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ANGIOGENISIS ASSOCIATES PROTEINS, AND NUCLEIC ACIDS 
ENCODING THE SAME 



Related Applications 

This application claims priority to U.S. provisional application Serial No. 
60/191,134 filed\03/22/2000, which is incorporated herein by reference in its entirety. 

Background 

Cities have roads and alleys, plants have xylem and phloem, and people have 
arteries, veins and lymphatics. Without these byways, the vertebrate animal cells would 
starve or drown in their metabolic refuse. Not only do blood vessels deliver food and 
oxygen and carry away metabolic wastes, but they also transport signaling substances that 
apprise cells of situations remote to them but to which they need to respond. Hormonal 
messages are a common signal. 

All blood vessels are ensheathed by a basal lamina and a delicate monolayer of 
remarkably plastic endothelial cells lining the luminal walls. Depending on location and 
function, smooth muscle and connective tissue may also be present. 

Not only do healthy cells depend on the blood resources transported by the 
circulatory system, but so, too, unwanted cells: tumorigenic and malignant cells. These 
cells colonize and proliferate if they are able to divert blood resources to themselves. 
Angiogenesis, the type of blood vessel formation where new vessels emerge from the 
proliferation of preexisting vessels (Risau, 1995; Risau andFlamme, 1995), is exploited 
not only by usual processes, such as in wound healing or myocardial infarction repair, but 
also by tumors themselves and in cancers, diabetic retinopathy, macular degeneration, 
psoriasis, and rheumatoid arthritis. Regardless of the process, whether pathological or 
usual physiological, endothelial cells mediate angiogenesis in a multi-step fashion: (1) 
endothelia receive an extracellular cue, (2) the signaled cells breach the basal lamina 
sheath, abetted by proteases they secrete, (3) the cells then migrate to the signal and 
proliferate, and finally, (4) the cells form a tube, a morphogenic event (Alberts et ah, 
1994). The complexity of this process indicates complex changes in cellular physiology 
and morphology, gene expression, and signaling. Angiogenic accomplices that are cues 



include basic fibroblast growth factors (bFGF), angiopoietins (such as ANG1) and 
various forms of vascular endothelial growth factor (VEGF). 

The molecular events and the order in which they occur and the pathways that are 
required for this process are of fundamental importance to understand angiogenesis. In 
vitro models are useful for identifying alterations in gene expression that occur during 
angiogenesis. A particularly fruitful model systems involves the supspension in a three- 
dimensional type I collagen gel and various stimuli, such as phorbol myristate acetate 
(PMA), basic fibroblast growth factor (bFGF), and VEGF. The combination of the 
stimuli and the collagen gel results in the formation of a three-dimensional tubular 
network of endothelial cells with interconnecting lumenal structures. In this model, 
endothelial differentiation into tubelike structures is completely blocked by inhibitors of 
new mRNA or protein synthesis. Furthermore, the cells progress through differentiation 
in a coordinated and synchronized manner, thus optimizing the profile of gene expression 
(Kahn et ai, 2000; Yang et al, 1999). 

Tumor cells exploit angiogenesis to facilitate tumor growth. Controlling 
angiogenesis, by controlling the activity or expression of genes and proteins associated 
with angiogenesis, provides a way to prevent tumor growth, or even destoy tumors. 

Summary 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the 
invention, and derivatives and fragments thereof, will hereinafter be collectively 
designated as "AAP" nucleic acid or polypeptide sequences. AAP, or angiogenesis 
associated polypeptides (AAP) comprises kelch-like polypeptide (KLP), human ortholog 
of mouse BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG 
(hTRG), human myosin X (hMXl) and its splice variant (hMX2), nuclear hormone 
receptor (NHR), and human mitochondrial protein (hMP). 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the 
invention, and derivatives and fragments thereof, will hereinafter be collectively 
designated as "AAP" nucleic acid or polypeptide sequences." 
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In a first aspect, the present invention is an isolated polypeptide having at least 
80% sequence identity to the sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, 
polynucleotides encoding the same, and antibodies that specifically bind the same. 

In a second aspect, the present invention is an isolated polynucleotide having at 
5 least 80% sequence identity to the sequence SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13 or 15, or a 

complement thereof. 

In a third aspect, the present invention is a transgenic non-human animal, having a 
disrupted AAP gene or a transgenic non-human animal expressing an exogenous 
polynucleotide having at least 80% sequence identity to the sequence SEQ ID NOS:l, 3, 
0 5, 7, 9, 1 1, 13 or 15, or a complement of said polynucleotide. 

In a fourth aspect, the present invention is a method of screening a sample for a 
mutation in an AAP gene. 

In a fifth aspect, the present invention is a method of modulating angiogenesis 
comprising modulating the activity of at least one AAP polypeptide. 
5 1" \ ■ In a sixth aspect, the present invention is a method of increasing, as well as 

^ - decreasing angiogenesis, comprising modulating the activity of at least one AAP 
polypeptide. Activity modulation of AAP polypeptides may be over-expressing or 
eliminating expression of the gene, or impairing a AAP polypeptide's function by contact 
with specific antagonists or agonists, such as antibodies or aptamers. 
) In a seventh aspect, the present invention is a method of treating various 

pathologies, including tumors, cancers, myocardial infarctions and the like. 

In an eighth aspect, the present invention is a method of measuring a AAP 
transcriptional and translational up-regulation or down-regulation activity of a compound. 
In a ninth aspect, the invention is a method of screening a tissue sample for 
> tumorigenic potential. 

In a tenth aspect, the invention is a method of determining the clinical stage of 
tumor which compares the expression of at least one AAP in a sample with expression of 
said at least one gene in control samples. 

Although methods and materials similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, suitable methods and 
materials are described below. All publications, patent applications, patents, and other 
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references mentioned herein are incorporated by reference in their entirety. In the case of 
conflict, the present specification, including definitions, will control. In addition, the 
materials, methods, and examples are illustrative only and not intended to be limiting. 

5 Detailed description 

A model of angiogenesis-the suspension of endothelial cells in type I collagen gels 
with various stimuli-was used to identify a molecular fingerprint or transcriptional profile 
of endothelial differentiation into tubelike structures, using amplification and an imaging 
approach called GeneCalling (Shimkets et al, 1999). This method was previously shown 

0 to provide a comprehensive sampling of cDNA populations in conjunction with the 

sensitive detection of quantitative differences in mRNA abundance for both known and 
novel genes. Many differentially expressed cDNA fragments were found. The 
identification and differential expression of these gens was confirmed by a second 
independent method employing real-time quantitative polymerase chain reaction (PCR). 

5 Although some of the identified cDNA fragments were genes known to play some role in 

angiogenesis, many other differentially expressed genes were unexpected. The inventors 
have identified the unexpected genes and polypeptides that are expressed in response to 
this model of angiogenesis, collectively refered to as angiogenesis associated 
polypeptides (AAP). AAP are kelch-like polypeptide (KLP), human ortholog of mouse 

) BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG (hTRG), 

human myosin X (hMXl) and its splice variant (hMX2), nuclear hormone receptor 
(NHR), and human mitochondrial protein (hMP). 

Definitions 

► Unless defined otherwise, all technical and scientific terms have the same 

meaning as is commonly understood by one of skill in the art to which this invention 
belongs. The definitions below are presented for clarity. All patents and publications 
referred to herein are, unless noted otherwise, incorporated by reference in their entirety. 

The recommendations of (Demerec et al. y 1966) where these are relevant to 
genetics are adapted herein. To distinguish between genes (and related nucleic acids) and 
the proteins that they encode, the abbreviations for genes are indicated by italicized (or 
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underlined) text while abbreviations for the proteins start with a capital letter and are not 
italicized. Thus, AAP or AAP r efers to the nucleotide sequence that encodes AAP. 

"Isolated," when referred to a molecule, refers to a molecule that has been 
identified and separated and/or recovered from a component of its natural environment. 
5 Contaminant components of its natural environment are materials that interfere with 

diagnostic or therapeutic use. 

"Container" is used broadly to mean any receptacle for holding material or 
reagent. Containers may be fabricated of glass, plastic, ceramic, metal, or any other 
}'~ material that can hold reagents. Acceptable materials will not react adversely with the 
0 contents. 

'= 1. Nucleic acid-related definitions 

(a) control sequences 

Control sequences are DNA sequences that enable the expression of an operably- 
5 linked coding sequence in a particular host organism. Prokaryotic control sequences 

include promoters, operator sequences, and ribosome binding sites. Eukaryotic cells 
utilize promoters, polyadenylation signals, and enhancers. 

(b) operably-linked 

Nucleic acid is operably-linked when it is placed into a functional relationship 
0 with another nucleic acid sequence. For example, a promoter or enhancer is operably- 

linked to a coding sequence if it affects the transcription of the sequence, or a ribosome- 
binding site is operably-linked to a coding sequence if positioned to facilitate translation. 
Generally, "operably-linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, 
5 enhancers do not have to be contiguous. Linking is accomplished by conventional 

recombinant DNA methods. 

(c) isolated nucleic acids 

An isolated nucleic acid molecule is purified from the setting in which it is found 
in nature and is separated from at least one contaminant nucleic acid molecule. Isolated 
) AAP molecules are distinguished from the specific AAP molecule, as it exists in cells. 

However, an isolated AAP molecule includes AAP molecules contained in cells that 



ordinarily express an AAP where, for example, the nucleic acid molecule is in a 
chromosomal location different from that of natural cells. 

2. Protein-related definitions 

(a) purified polypeptide 

When the molecule is a purified polypeptide, the polypeptide will be purified (1) 
to obtain at least 15 residues of N-terminal or internal amino acid sequence using a 
sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing 
conditions using Coomassie blue or silver stain. Isolated polypeptides include those 
expressed heterologously in genetically-engineered cells or expressed in vitro, since at 
least one component of an AAP natural environment will not be present. Ordinarily, 
isolated polypeptides are prepared by at least one purification step. 

(b) active polypeptide 

An active AAP or AAP fragment retains a biological and/or an immunological 
activity of the native or naturally-occurring AAP. Immunological activity refers to the 
ability to induce the production of an antibody against an antigenic epitope possessed by a 
native AAP; biological activity refers to a function, either inhibitory or stimulatory, 
caused by a native AAP that excludes immunological activity. A biological activity of 
AAP includes, for example, modulating angiogenesis. 

(c) Abs 

Antibody may be single anti-AAP monoclonal Abs (including agonist, antagonist, 
and neutralizing Abs), anti-AAP antibody compositions with polyepitopic specificity, 
single chain anti-AAP Abs, and fragments of anti-AAP Abs. A "monoclonal antibody" 
refers to an antibody obtained from a population of substantially homogeneous Abs, i.e., 
the individual Abs comprising the population are identical except for naturally-occurring 
mutations that may be present in minor amounts 

(d) epitope tags 

An epitope tagged polypeptide refers to a chimeric polypeptide fused to a "tag 
polypeptide". Such tags provide epitopes against which Abs can be made or are 
available, but do not interfere with polypeptide activity. To reduce anti-tag antibody 
reactivity with endogenous epitopes, the tag polypeptide is preferably unique. Suitable tag 
polypeptides generally have at least six amino acid residues and usually between about 8 
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and 50 amino acid residues, preferably between 8 and 20 amino acid residues). Examples 
of epitope tag sequences include HA from Influenza A virus and FLAG. 

The novel AAP of the invention include the nucleic acids whose sequences are 
provided in Tables 1, 3, 5, 7, 9, 11, 13 and 15, or a fragment thereof. The invention also 

5 includes a mutant or variant AAP, any of whose bases may be changed from the 

corresponding base shown in Tables 1, 3, 5, 7, 9, 11, 13 and 15 while still encoding a 
protein that maintains the activities and physiological functions of the AAP fragment, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose 
sequences are complementary to those just described, including complementary nucleic 

0 acid fragments. The invention additionally includes nucleic acids or nucleic acid 

fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of nonlimiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications 
are carried out at least in part to enhance the chemical stability of the modified nucleic 

5 acid, such that they may be used, for example, as anti-sense binding nucleic acids in 

therapeutic applications in a subject. In the mutant or variant nucleic acids, and their 
complements, up to 20% or more of the bases may be so changed. 

The novel AAP of the invention include the protein fragments whose sequences 
are provided in Tables 2, 4, 6, 8, 10, 12, 14 and 16. The invention also includes an AAP 

0 mutant or variant protein, any of whose residues may be changed from the corresponding 

residue shown in Tables 2, 4, 6, 8, 10, 12, 14 and 16 while still encoding a protein that 
maintains its native activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant AAP, up to 20% or more of the residues may be so 
changed. The invention further encompasses Abs and antibody fragments, such as F ab or 

> (F a b)'2, that bind immunospecifically to any of the AAP of the invention. 

The AAP nucleic acids are shown in Tables 1, 3, 5, 7, 9, 11, 13 and 15, and the 
corresponding polypeptides are shown in Tables 2, 4, 6, 8, 10, 12, 14 and 16, respectivly. 
Start and stop codons in the polynucleotide sequences are indicated in boldface and with 
underlining . SEQ ID NO:3 lacks a stop codon. The sequences of hMXl and hMX2 do 
not have start codons (see Table 17); consequently, hMXl and hMX2 polypeptides do not 
start with a Met. For any lacking polynucleotide sequence, one of skill in the art may 
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retrieve the full length sequence by, for example, probing cDNA or genomic libraries 
with probes designed according to the sequences of the instant invention. 



Table 1 KLP nucleotide sequence (SEQ ID NO: 1) 



ctggcctaga 


tactacaact 


gaactttttt 


tctttttagt 


tactccacag 


gatccgctga 


60 


acataggatg 


ttgccacaaa 


atctacctcg 


tgtatttttc 


tctttcactc 


atgagctgca 


120 


caattgcaga 


tttgagcaca 


atgtctgcag 


actgtgttga 


aaaactctga 


agaacctaat 


180 


taacacagga 


tgacctagga 


gtgattctaa 


gtctgtgtaa 


caagatatta 


ctcattagtg 


240 


aatgtgtcag 


tcttggtact 


gaatgctgca 


gataacagca 


agtaggttct 


cctttatttc 


300 


tgaagtattc 


acttgacctt 


ccatcagtaa 


gacggacttt 


tctaatctgt 


tcctggagat 


360 


attaatggaa 


tacagtcatg 


tccactcaag 


acgagaggca 


gatcaatact 


gaatatgctg 


420 


tgtcattgtt 


ggaacagttg 


aaactgtttt 


atgaacagca 


gttgtttact 


gacatagtgt 


480 


taattgttga 


gggcactgaa 


ttcccttgtc 


ataagatggt 


tcttgcaaca 


tgtagctctt 


540 


atttcagggc 


catgtttatg 


agtggactaa 


gtgaaagcaa 


acaaacccat 


gtacacctga 


600 


ggaatgtcga 


tgctgccacc 


ttacagataa 


taataactta 


tgcatacacg 


ggtaacttgg 


660 


caatgaatga 


cagcactgta 


gaacagcttt 


atgaaacagc 


ttgcttccta 


caggtagaag 


720 


atgtgttaca 


acgttgtcga 


gaatatttaa 


ttaaaaaaat 


aaatgcagag 


aattgtgtac 


780 


gattgttgag 


ttttgctgat 


ctcttcagtt 


gtgaggaatt 


aaaacagagt 


gctaaaagaa 


840 


tggtggagca 


caagttcact 


gctgtgtatc 


atcaggacgc 


gttcatgcag 


ctgtcacatg 


900 


acctactgat 


agatattctc 


agtagtgaca 


atttaaatgt 


agaaaaggaa 


gaaaccgttc 


960 


gagaagctgc 


tatgctgtgg 


ctagagtata 


acacagaatc 


acgatcccag 


tatttgtctt 


1020 


ctgttcttag 


ccaaatcaga 


attgatgcac 


tttcagaagt 


aacacagaga 


gcttggtttc 


1080 


aaggtctgcc 


acccaatgat 


aagtcagtgg 


tggttcaagg 


tctgtataag 


tccatgccca 


1140 


agtttttcaa 


accaagactt 


gggatgacta 


aagaggaaat 


gatgattttc 


attgaagcat 


1200 


cttcagaaaa 


tccttgtagt 


ctttactctt 


ctgtctgtta 


cagcccccaa 


gcagaaaaag 


1260 


tttacaagtt 


atgtagccca 


ccagctgatt 


tgcataaggt 


tgggaccgtt 


gtaactcctg 


1320 


ataatgatat 


ctacatagca 


gggggtcaag 


ttcctctgaa 


aaacacaaaa 


acaaatcaca 


1380 


gtaaaacaag 


caaacttcag 


actgccttca 


gaactgtgaa 


ttgcttttat 


tggtttgatg 


1440 


cacagcaaaa 


tacctggttt 


ccaaagaccc 


caatgctttt 


tgtccgcata 


aagccatctt 


1500 


tggtttgctg 


tgaaggctat 


atctatgcaa 


ttggaggaga 


tagcgtaggt 


ggagaactta 


1560 


atcggaggac 


cgtagaaaga 


tacgacactg 


agaaagatga 


gtggacgatg 


gtaagccctt 


1620 


taccttgtgc 


ttggcaatgg 


agtgcagcag 


ttgtggttca 


tgactgcatt 


tatgtgatga 


1680 


cactgaacct 


catgtactgt 


tattttccaa 


ggtctgactc 


atgggtagaa 


atggccatga 


1740 


gacagactag 


taggtccttt 


gcttcagctg 


cagcttttgg 


tgataaaatt 


ttctatattg 


1800 


gagggttgca 


tattgctacc 


aattccggca 


taagactccc 


ctctggcact 


gtagatgggt 


1860 


cttcagtaac 


tgtggaaatt 


tatgatgtga 


ataaaaatga 


gtggaaaatg 


gcagccaaca 


1920 


tccctgctaa 


gaggtactct 


gacccctgtg 


ttagagctgt 


tgtgatctca 


aattctctat 


1980 


gtgtgtttat 


gcgagaaacc 


cacttaaatg 


agcgagctaa 


atacgtcacc 


taccaatatg 


2040 


acctggaact 


tgaccggtgg 


tctctgcggc 


agcatatatc 


tgaacgtgta 


ctgtgggact 


2100 


tggggagaga 


ttttcgatgc 


actgtgggga 


aactctatcc 


atcctgcctt 


gaagagtctc 


2160 


catggaaacc 


accaacttat 


cttttttcaa 


cggatgggac 


agaagagttt 


gaactggatg 


2220 


gagaaatggt 


tgcactacca 


cctgtatagt 


ggggaagttc 


agggagtgca 


cgcctgagtt 


2280 
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atgtgctttg 


tcattttctt 


tgctaaacaa 


aagaggctat 


gaaagaacta 


aatatgagta 


2340 


cataaaattc 


tatctttgat 


aaattttatt 


tttatgccct 


acttaatatt 


tgcatcagta 


2400 


taatatatat 


cagtgagtct 


tacagaaaga 


tatgcttcca 


taatatgaaa 


tagattattc 


2460 


aataattgag 


aaactttatg 


tgtaatcatg 


agagtataag 


aatctggatt 


atctaacatt 


2520 


gttagccctg 


tgtatgtaca 


gttcaaaaag 


ttcatttata 


aaagtagttt 


cctgttc 


2577 



Table 2 KLP polypeptide sequence (SEQ ID NO: 2) 



Met 

1 


Ser 


Thr 


Gin 


Asp 
5 


Glu 


Arg 


Gin 


He 


Asn 
10 


Thr 


Glu 


Tyr 


Ala 


Val 
15 


Ser 


Leu 


Leu 


Glu 


Gin 
20 


Leu 


Lys 


Leu 


Phe 


Tyr 
25 


Glu 


Gin 


Gin 


Leu 


Phe 
30 


Thr 


Asp 


lie 


Val 


Leu 
35 


He 


Val 


Glu 


Gly 


Thr 
40 


Glu 


Phe 


Pro 


Cys 


His 
45 


Lys 


Met 


Val 


Leu 


Ala 
50 


Thr 


Cys 


Ser 


Ser 


Tyr 
55 


Phe 


Arg 


Ala 


Met 


Phe 
60 


Met 


Ser 


Gly 


Leu 


Ser 
65 


Glu 


Ser 


Lys 


Gin 


Thr 
70 


His 


Val 


His 


Leu 


Arg 
75 


Asn 


Val 


Asp 


Ala 


Ala 
80 


Thr 


Leu 


Gin 


He 


He 
85 


He 


Thr 


Tyr 


Ala 


Tyr 
90 


Thr 


Gly 


Asn 


Leu 


Ala 
95 


Met 


Asn 


Asp 


Ser 


Thr 
100 


Val 


Glu 


Gin 


Leu 


Tyr 
105 


Glu 


Thr 


Ala 


Cys 


Phe 
110 


Leu 


Gin 


Val 


Glu 


Asp 
115 


Val 


Leu 


Gin 


Arg 


Cys 
120 


Arg 


Glu 


Tyr 


Leu 


He 
125 


Lys 


Lys 


He 


Asn 


Ala 
130 


Glu 


Asn 


Cys 


Val 


Arg 
135 


Leu 


Leu 


Ser 


Phe 


Ala 
140 


Asp 


Leu 


Phe 


Ser 


Cys 
145 


Glu 


Glu 


Leu 


Lys 


Gin 
150 


Ser 


Ala 


Lys 


Arg 


Met 
155 


Val 


Glu 


His 


Lys 


Phe 
160 


Thr 


Ala 


Val 


Tyr 


His 
165 


Gin 


Asp 


Ala 


Phe 


Met 
170 


Gin 


Leu 


Ser 


His 


Asp 
175 


Leu 


Leu 


He 


Asp 


He 
180 


Leu 


Ser 


Ser 


Asp 


Asn 
185 


Leu 


Asn 


Val 


Glu 


Lys 
190 


Glu 


Glu 


Thr 


Val 


Arg 
195 


Glu 


Ala 


Ala 


Met 


Leu 
200 


Trp 


Leu 


Glu 


Tyr 


Asn 
205 


Thr 


Glu 


Ser 


Arg 


Ser 
210 


Gin 


Tyr 


Leu 


Ser 


Ser 
215 


Val 


Leu 


Ser 


Gin 


He 
220 


Arg 


He 


Asp 


Ala 


Leu 
225 


Ser 


Glu 


Val 


Thr 


Gin 
230 


Arg 


Ala 


Trp 


Phe 


Gin 
235 


Gly 


Leu 


Pro 


Pro 


Asn 
240 


Asp 


Lys 


Ser 


Val 


Val 
245 


Val 


Gin 


Gly 


Leu 


Tyr 
250 


Lys 


Ser 


Met 


Pro 


Lys 
255 


Phe 


Phe 


Lys 


Pro 


Arg 
260 


Leu 


Gly 


Met 


Thr 


Lys 
265 


Glu 


Glu 


Met 


Met 


He 
270 


Phe 


He 
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Glu 


Ala 


Ser 
275 


Ser 


Glu 


Asn 


Pro 


Cys 
280 


Ser 


Leu 


Tyr 


Ser 


Ser 
285 


Val 


Cys 


Tyr 


Ser 


Pro 
290 


Gin 


Ala 


Glu 


Lys 


Val 
295 


Tyr 


Lys 


Leu 


Cys 


Ser 
300 


Pro 


Pro 


Ala 


Asp 


Leu 
305 


His 


Lys 


Val 


Gly 


Thr 
310 


Val 


Val 


Thr 


Pro 


Asp 
315 


Asn 


Asp 


He 


Tyr 


He 
320 


Ala 


Gly 


Gly 


Gin 


Val 
325 


Pro 


Leu 


Lys 


Asn 


Thr 
330 


Lys 


Thr 


Asn 


His 


Ser 
335 


Lys 


Thr 


Ser 


Lys 


Leu 
340 


Gin 


Thr 


Ala 


Phe 


Arg 
345 


Thr 


Val 


Asn 


Cys 


Phe 
350 


Tyr 


Trp 


Phe 


Asp 


Ala 
355 


Gin 


Gin 


Asn 


Thr 


Trp 
360 


Phe 


Pro 


Lys 


Thr 


Pro 
365 


Met 


Leu 


Phe 


Val 


Arg 
370 


He 


Lys 


Pro 


Ser 


Leu 
375 


Val 


Cys 


Cys 


Glu 


Gly 
380 


Tyr 


He 


Tyr 


Ala 


He Gly 
385 


Gly 


Asp 


Ser 


Val 
390 


Gly 


Gly 


Glu 


Leu 


Asn 
395 


Arg 


Arg 


Thr 


Val 


Glu 
400 






Asp 


Thr 


Glu 
405 


Lys 


Asp 


Glu 


Trp 


Thr 
410 


Met 


Val 


Ser 


Pro 


Leu 
415 


Pro 






Trp 


Gin 
420 


Trp 


Ser 


Ala 


Ala 


Val 
425 


Val 


Val 


His 


Asp 


Cys 
430 


He 


Tyr 


Val 


Met 


Thr 
435 




Asn 


Leu 


Met 


Tyr 
440 


Cys 


Tyr 


Phe 


Pro 


Arg 
445 


Ser 


Asp 


Ser 


Trp 


Val 
450 


Glu 


Met 


Ala 


Met 


Arg 
455 


Gin 


Thr 


Ser 


Arg 


Ser 
460 


Phe 


Ala 


Ser 


Ala 


Ala 
465 


Ala 


Phe 


Gly 


Asp 


Lys 
470 


He 


Phe 


Tyr 


He 


Gly 
475 


Gly 


Leu 


His 


He 


Ala 
480 


Thr 


Asn 


Ser 


Gly 


He 
485 


Arg 


Leu 


Pro 


Ser 


Gly 
490 


Thr 


Val 


Asp 


Gly 


Ser 
495 


Ser 


Val 


Thr 


Val 


Glu 
500 


He 


Tyr 


Asp 


Val 


Asn 
505 


Lys 


Asn 


Glu 


Trp 


Lys 
510 


Met 


Ala 


Ala 


Asn 


He 
515 


Pro 


Ala 


Lys 


Arg 


Tyr 
520 


Ser 


Asp 


Pro 


Cys 


Val 
525 


Arg 


Ala 


Val 


Val 


lie 
530 


Ser 




Ser 




Cys 
535 


Val 


Phe 


Met 


Arg 


Glu 
540 


Thr 


His 


Leu 


Asn 


Glu 
545 


Arg 


Ala 


Lys 


Tyr 


Val 
550 


Thr 


Tyr 


Gin 


Tyr 


Asp 
555 


Leu 


Glu 


Leu 


Asp 


Arg 
560 


Trp 


Ser 






Gin 
565 


His 


He 


Ser 


Glu 


57 0 


Val 








575 


Gly 


Arg 


Asp 


Phe 


Arg 
580 


Cys 


Thr 


Val 


Gly 


Lys 
585 


Leu 


Tyr 


Pro 


Ser 


Cys 
590 


Leu 


Glu 


Glu 


Ser 


Pro 
595 


Trp 


Lys 


Pro 


Pro 


Thr 
600 


Tyr 


Leu 


Phe 


Ser 


Thr 
605 


Asp 


Gly 


Thr 


Glu 


Glu 
610 


Phe 


Glu 


Leu 


Asp 


Gly 
615 


Glu 


Met 


Val 


Ala 


Leu 
620 


Pro 


Pro 


Val 





Table 3 



hBAZF nucleotide sequence (SEQ ID NO:3) 



caagggagcg 


agggtgtcgt 


agagggcaga 


atgaacaaga 


agaattagga 


gggaggctgc 


60 


gtgtgccggg 


gctaggggct 


ggaagtcctg 


gctctagttg 


cacctcggaa 


ggaaaaggca 


120 


aacagaggag 


ggaaggcgtc 


ttaggactgc 


ctggatccag 


agcactttcc 


tcggcctcta 


180 


caggcctgtg 


tcgctatggg 


ttcccccgcc 


gccccggagg 


gagcgctggg 


ctacgtccgc 


240 


gagttcactc 


gccactcctc 


cgacgtgctg 


ggcaacctca 


acgagctgcg 


cctgcgcggg 


300 


atcctcactg 


acgtcacgct 


gctggttggc 


gggcaacccc 


tcagagcaca 


caaggcagtt 


360 


ctcatcgcct 


gcagtggctt 


cttctattca 


attttccggg 


gccgtgcggg 


agtcggggtg 


420 


gacgtgctct 


ctctgcccgg 


gggtcccgaa 


gcgagaggct 


tcgcccctct 


attggacttc 


480 


atgtacactt 


cgcgcctgcg 


cctctctcca 


gccactgcac 


cagcagtcct 


agcggccgcc 


540 


acctatttgc 


agatggagca 


cgtggtccag 


gcatgccacc 


gcttcatcca 


ggccagctat 


600 


gaacctctgg 


gcatctccct 


gcgccccctg 


gaagcagaac 


ccccaacacc 


cccaacggcc 


660 


cctccaccag 


gtagtcccag 


gcgctccgaa 


ggacacccag 


acccacctac 


tgaatctcga 


720 


agctgcagtc 


aaggcccccc 


cagtccagcc 


agccctgacc 


ccaaggcctg 


caactggaaa 


780 


aagtacaagt 


acatcgtgct 


aaactctcag 


gcctcccaag 


cagggagcct 


ggtcggggag 


840 


agaagttctg 


gtcaaccttg 


cccccaagcc 


aggctcccca 


gtggagacga 


ggcctccagc 


900 


agcagcagca 


gcagcagcag 


cagcagcagt 


gaagaaggac 


ccattcctgg 


tccccagagc 


960 


aggctctctc 


caactgctgc 


cactgtgcag 


ttcaaatgtg 


gggctccagc 


cagtaccccc 


1020 


tacctcctca 


catcccaggc 


tcaagacacc 


tctggatcac 


cctctgaacg 


ggctcgtcca 


1080 


ctacccggga 


gtgaattttt 


cagctgccag 


aactgtgagg 


ctgtggcagg 


gtgctcatcg 


1140 


gggctggact 


ccttggttcc 


tggggacgaa 


gacaaaccct 


ataagtgtca 


gctgtgccgg 




tcttcgttcc 


gctacaaggg 


caaccttgcc 


agtcaccgta 


cagtgcacac 


aggggaaaag 


1260 


ccttaccact 


gctcaatctg 


cggagcccgt 


tttaaccggc 


cagcaaacct 


gaaaacgcac 


1320 


agccgcatcc 


attcgggaga 


gaagccgtat 


aagtgtgaga 


cgtgcggctc 


gcgctttgta 


1380 


caggtacgga 


gccagcctcc 


aagtggcttc 


caaggcaaac 


ctgcaagagg 


tggggtgggc 


1440 


caaaagggag 


ggttctgttc 


ctcccagagg 


caggacttga 


agtctcctcc 


ctcccaggtg 


1500 


gcacatctgc 


gggcgcacgt 


gctgatccac 


accggggaga 


agccctaccc 


ttgccctacc 


1560 


tgcggaaccc 


gcttccgcca 


cctgcagacc 


ctcaagagcc 


acgttcgcat 


ccacaccgga 


1620 


gagaagcctt 


accactgcga 


cccctgtggc 


ctgcatttcc 


ggcacaagag 


tcaactgcgg 


1680 


ctgcatctgc 


gccagaaaca 


cggagctgct 


accaacacca 


aagtgcacta 


ccacattctc 


1740 


ggggggccc 












1749 



Table 4 hBAZF polypeptide sequence (SEQ ID NO:4) 



Met 


Gly 


Ser 


Pro 


Ala 


Ala 


Pro 


Glu 


Gly 


Ala 


Leu 


Gly 


Tyr Val Arg 


Glu 


1 








5 










10 






15 




Phe 


Thr 


Arg 


His 


Ser 


Ser 


Asp 


Val 


Leu 


Gly 


Asn 


Leu 


Asn Glu Leu 


Arg 








20 










25 








30 




Leu 


Arg 


Gly 


He 


Leu 


Thr 


Asp 


Val 


Thr 


Leu 


Leu 


Val 


Gly Gly Gin 


Pro 






35 










40 










45 
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Arg 
50 


Ala 


His 


Lys 


Ala 


Val 
55 


Leu 


lie 


Ala 


Cys 


Ser 
60 


Gly 


Phe 


Phe 


Tyr 


Ser 
65 


lie 


Phe 


Arg 


Gly 


Arg 
70 


Ala 


Gly 


Val 


Gly 


Val 
75 


Asp 


Val 


Leu 


Ser 


Leu 
80 


Pro 


Gly 


Gly 


Pro 


Glu 
85 


Ala 


Arg 


Gly 


Phe 


Ala 
90 


Pro 


Leu 


Leu 


Asp 


Phe 
95 


Met 


Tyr 


Thr 


Ser 


Arg 
100 


Leu 


Arg 


Leu 


Ser 


Pro 
105 


Ala 


Thr 


Ala 


Pro 


Ala 
110 


Val 


Leu 


Ala 


Ala 


Ala 
115 


Thr 


Tyr 


Leu 


Gin 


Met 
120 


Glu 


His 


Val 


Val 


Gin 
125 


Ala 


Cys 


His 


Arg 


Phe 
130 


lie 


Gin 


Ala 


Ser 


Tyr 
135 


Glu 


Pro 


Leu 


Gly 


He 
140 


Ser 


Leu 


Arg 


Pro 


Leu 
145 


Glu 


Ala 


Glu 


Pro 


Pro 
150 


Thr 


Pro 


Pro 


Thr 


Ala 
155 


Pro 


Pro 


Pro 


Gly 


Ser 
160 


Pro 


Arg 


Arg 


Ser 


Glu 
165 


Gly 


His 


Pro 


Asp 


Pro 
170 


Pro 


Thr 


Glu 


Ser 


Arg 
175 


Ser 


Cys 


Ser 


Gin 


Gly 
180 


Pro 


Pro 


Ser 


Pro 


Ala 
185 


Ser 


Pro 


Asp 


Pro 


Lys 
190 


Ala 


Cys 




Trp 


195 


Lys 


Tyr 


Lys 


Tyr 


He 
200 


Val 


Leu 


Asn 


Ser 


Gin 
205 


Ala 


Ser 


Gin 


Ala 


Gly 
210 


Ser 


Leu 


Val 


Gly 


Glu 

215 


Arg 


Ser 


Ser 


Gly 


Gin 
220 


Pro 


Cys 


Pro 


Gin 


Ala 
225 


Arg 




Pro 


Ser 


Gly 
230 


Asp 


Glu 


Ala 


Ser 


Ser 
235 


Ser 


Ser 


Ser 


Ser 


Ser 
240 


Ser 


Ser 


Ser 


Ser 


Ser 
245 


Glu 


Glu 


Gly 


Pro 


He 
250 


Pro 


Gly 


Pro 


Gin 


Ser 
255 


Arg 


Leu 


Ser 


Pro 


Thr 
260 


Ala 


Ala 


Thr 


Val 


Gin 
265 


Phe 


Lys 


Cys 


Gly 


Ala 
270 


Pro 


Ala 


Ser 


Thr 


Pro 
275 


Tyr 


Leu 


Leu 


Thr 


Ser 
280 


Gin 


Ala 


Gin 


Asp 


Thr 
285 


Ser 


Gly 


Ser 


Pro 


Ser 
290 


Glu 


Arg 


Ala 


Arg 


Pro 
295 


Leu 


Pro 


Gly 


Ser 


Glu 
300 


Phe 


Phe 


Ser 


Cys 


Gin 
305 


Asn 


Cys 


Glu 


Ala 


Val 
310 


Ala 


Gly 


Cys 


Ser 


Ser 
315 


Gly 


Leu 


Asp 


Ser 


Leu 
320 


Val 


Pro 


Gly 


Asp 


Glu 
325 


Asp 


Lys 


Pro 


Tyr 


Lys 
330 


Cys 


Gin 




Cys 


335 


Ser 


Ser 


Phe 


Arg 


Tyr 
340 


Lys 


Gly 


Asn 


Leu 


Ala 
345 


Ser 


His 


Arg 


Thr 


Val 
350 


His 


Thr 


Gly Glu 


Lys 
355 


Pro 


Tyr 


His 


Cys 


Ser 
360 


lie 


Cys 


Gly 


Ala 


Arg 
365 


Phe 


Asn 


Arg 


Pro 


Ala 
370 


Asn 


Leu 


Lys 


Thr 


His 
375 


Ser 


Arg 


He 


His 


Ser 
380 


Gly 


Glu 


Lys 


Pro 


Tyr 
385 


Lys 


Cys 


Glu 


Thr 


Cys 
390 


Gly 


Ser 


Arg 


Phe 


Val 
395 


Gin 


Val 


Arg 


Ser 


Gin 
400 
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Ser 




Phe 
405 




Gly 


Lys 




410 




Gly 






Gly 
415 




Lys 


Gly 


Gly 


Phe 
420 


Cys 


Ser 


Ser 


Gin 


Arg 
425 


Gin 


Asp 


Leu 


Lys 


Ser 
430 


Pro 


Pro 


Ser 


Gin 


Val 
435 


Ala 


His 


Leu 


Arg 


Ala 
440 


His 


Val 


Leu 


He 


His 
445 


Thr 


Gly 


Glu 


Lys 


Pro 
450 


Tyr 


Pro 


Cys 


Pro 


Thr 
455 


Cys 


Gly 


Thr 


Arg 


Phe 
460 


Arg 


His 


Leu 


Gin 


Thr 
465 




Lys 


Ser 


His 


Val 
470 


Arg 


He 


His 


Thr 


Gly 
475 


Glu 


Lys 


Pro 


Tyr 


His 
480 


Cys 


Asp 


Pro 


Cys 


Gly 
485 


Leu 


His 


Phe 


Arg 


His 
490 


Lys 


Ser 


Gin 


Leu 


Arg 
495 


Leu 


His 


Leu 


Arg 


Gin 
500 


Lys 


His 


Gly 


Ala 


Ala 
505 


Thr 


Asn 


Thr 


Lys 


Val 
510 


His 


Tyr 


His 


He 


Leu 
515 


Gly 


Gly 


Pro 























Table 5 hEF-G nucleotide sequence (SEQ ID NO: 5) 



tctttttcct 


cgcgtccttt 


gccccggaag 


tgctcttaca 


acattggctg 


ccggcgtgac 


60 


tttgaccgct 


tcccggtgcg 


ttaccggcag 


ctgaacccac 


ccggcgccac 


gggactttga 


120 


cgcgtgctct 


gcgcttgcca 


tgagactcct 


gggagctgca 


gccgtcgcgg 


ctctggggcg 


180 


cggaagggcc 


cccgcctccc 


taggctggca 


gaggaagcag 


gttaattgga 


aggcctgccg 


240 


atggtcttca 


tcaggggtga 


ttcctaatga 


aaaaatacga 


aatattggaa 


tctcagctca 


300 


cattgattct 


gggaaaacta 


cattaacaga 


acgagtcctt 


tactacactg 


gcagaattgc 


360 


aaagatgcat 


gaggtgaaag 


gtaaagatgg 


agttggtgct 


gtcatggatt 


ccatggaact 


420 


agagagacaa 


agaggaatca 


ctattcagtc 


agcagccact 


ttcaccatgt 


ggaaagatgt 


480 


caatattaac 


attatagata 


ctcctgggca 


tgtggacttc 


acaatagaag 


tggaaagggc 


540 


cctgagagtg 


ttggatggtg 


cagtccttgt 


tctctgtgct 


gttggagggg 


tacagtgcca 


600 


gaccatgact 


gtcaatcgtc 


agatgaagcg 


ctacaacgtt 


ccgtttctaa 


cttttattaa 


660 


caaattggac 


cgaatgggct 


ccaacccagc 


cagggccctg 


cagcaaatga 


ggtctaaact 


720 


aaatcataat 


acagcgttta 


tgcagatacc 


catgggtttg 


gagggtaatt 


ttaaaggtat 


780 


tgtagatctt 


attgaggaac 


gagccatcta 


ttttgatgga 


gactttagtc 


agattgttcg 


840 


atatggtgag 


attccagctg 


aattaagggc 


ggcggccact 


gaccaccggc 


aggagctaat 


900 


tgaatgtgtt 


gccaattcag 


atgaacagct 


tggtgagatg 


tttctggaag 


aaaaaatccc 


960 


ctcgatttct 


gatttaaagc 


tagcaattcg 


aagagctact 


ctgaaaagat 


catttactcc 


1020 


tgtatttttg 


ggaagcgcct 


tgaagaacaa 


aggagttcag 


cctcttttag 


atgctgtttt 


1080 


agaatacctc 


ccaaatccat 


ctgaagtcca 


gaactatgct 


attctcaata 


aaaaggatga 


1140 


ctcaaaagag 


aaaaccaaaa 


tcctaatgaa 


ctccagtaga 


cacaattccc 


acccatttgt 


1200 


aggcctggct 


tttcccctgg 


aggtaggtcg 


atttggacaa 


ttaacttatg 


ttcgcagtta 


1260 


tcagggagag 


ctaaagaagg 


gtgacaccat 


ctataacaca 


aggacaagaa 


agaaagtacg 


1320 


gttgcaacgg 


ctggctcgca 


tgcatgccga 


catgatggag 


gcaagtacag 


aggaagtata 


1380 


tgccggagac 


atctgtgcat 


tgtttggcat 


tgactgtgct 


agtggagaca 


cattcacaga 


1440 
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Cci3.ciCjCC3.ciC 


agcggccfc tt 




aattcatgtt 




tcatttcaat 


1500 


agcaatgaag 


cctfcctaaca 


agaacgatct 


ggaaaaattt 


tcaaaaggta 


ttggcaggtt 


1560 


tacaagagaa 


gatcccacat 


ttaaagtata 


ctttgacact 


gagaacaaag 


agacagttat 


1620 


atctggaatg 


ggagaattac 


acctggaaat 


ctatgctcag 


aggctggaaa 


gagagtatgg 


1680 


ctgtcctfcgt 


atcacaggaa 








ctgcccctgt 


1740 






aalaacaatc 


agg^gtgca 
ctLactaaa 


ggccagtatg 


gaaaag aat 




Iggtgtcctg 


gagcctctgj 






ttggaatttt 


cagatgaaac 


1860 


Ittcggatca 


aatattccaa 


agcagtttgt 


^ct^t^ta 






1920 


ctg cg a gaag 


ccctcttt 


c tggtcacaa. 


^tctct 3 3 


c ccgg g 


tcc^casTa 
cc gcaaga 


1980 


tggagcacac 


cacatggttg 


attc taatga 


aLctctttc 


^cctatta 


gagaaggtgc 


2040 


tcttaaacaa 


gcctt^caa 

ccLat/^aat 


atgcaacatt 


a tgtat tc t t 


gaacc a 


tggctgtgga 


2100 


a g 1 1 g t a. g c t 




cagggaca 




ggaa taacc 


gaegecatgg 


2160 


ggtaatcact 


gggcaagatg 




cta.tttta.ca. 


ctgtatgcag 


atgtccctct 


2220 




tttggttatt 


ccac/aact 
ccactgaact 




acagagggaa 


agggagaat a 


2280 


cacaatggag 


tatagcaggt 


atcagccatg 


tttaccatcc 


acacaagaag 


aegtcat taa 


3 0 


1 a a g 1 a. 1 1 1 g 


gaagctacag 


gtcaacttcc 


tgt taaaaaa 


ggaaaagcca. 


acfaactaac t 


2400 




tgagt tgac t 


gac c aatt 


gaatc tgcgt 


ggt tttgata 


ctttgatgga 


2460 


ttccagtgga 


ataaat tcag 


gctgc tgsaa 


caagaaattc 


tgagcccagg 


aagegggetc 


2520 


ttctttcttc 






a tat tcagga 


get tc tgt ta 


tat tcaaagg 


2580 


t aattc tatg 


tctatctcaa 


ctctattgat 


tggt t ttata 


gt teat tgaa 


aat cc t caaa 


2640 




t tat tact ga 


aata t gt t ta 




ggaaaagaga 


ctaatttcag 


700 


t tatact t t t 


aagcttagaa 


tgtatgttca 


tttccaaatt 


ttgtatcata 


agagttttca 


2760 




aagctgaaaa 




ataaccacat 


actttccatc 


taccttcctt 


2820 


tgttaacggg 


a^tctcataa 


tataataatt 


tgttttgtca 


tat t tgc t t t 


cacfcgfcctat 




tatctgttta 




ctctattttt 


agtttgctga 


agacttgaaa 


gtgaatcgea 


2940 




cacttctt^ 




<3- 1 Q Q Q c a g g c 


tfctfccfcgtfcg 


aagagtggat 




tccgtatgtt 


cttcatagag 


agtgtttttc 










ccaaatttcn 


ctctgtttta 


tatatgncag 


tttatttcag 


tttgtggttt 


ctgcaaattt 


3120 


gtaactgcct 


ctgttttagg 


agtataagta 


ttacttcctt 


gtggtctatt 


gtgaagtaaa 


3180 


aagtagaccc 


ttgcatatac 


tattcttgtt 


tgtgttcatc 


ttaatgtttt 


tgtacagcta 


3240 


aatcaaatgt 


aatttataga 


gttagtttca 


tcaacctaat 


gaatgctagt 


taaatttgaa 


3300 


ttccttggaa 


tttatcgtat 


attgtattca 


ctgagattat 


gaagggacaa 


atgttaatct 


3360 


tttgtttcca 


gaaaaagttg 


ggctttccca 


agcagttcta 


ttacccggtt 


cagaattget 


3420 


tcatccaaaa 


atcatctgat 


ggtatagatg 


gatcctagtc 


cttttcatta 


cctgatggta 


3480 


gaaataaaat 


aattgatttt 


a 








3501 



Table 6 hEF-G polypeptide (SEQ ID NO:6) 



Met 


Arg 


Leu 


Leu 


Gly 


Ala 


Ala Ala 


Val 


Ala 


Ala 


Leu 


Gly Arg 


Gly Arg 


1 








5 








10 








15 


Ala 


Pro 


Ala 


Ser 


Leu 


Gly 


Trp Gin 


Arg 


Lys 


Gin 


Val 


Asn Trp 


Lys Ala 








20 








25 








30 




Cys 


Arg 


Trp 


Ser 


Ser 


Ser 


Gly Val 


He 


Pro 




Glu 




Arg Asn 






35 








40 










45 
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He 


Gly 
50 


He 


Ser 


Ala 


His 


He 
55 


Asp 


Ser 


Gly 


Lys 


Thr 
60 


Thr 


Leu 


Thr 


Glu 


Arg 
65 


Val 


Leu 


Tyr 


Tyr 


Thr 
70 


Gly 


Arg 


He 


Ala 


Lys 
75 


Met 


His 


Glu 


Val 


Lys 
80 


Gly 


Lys 


Asp 


Gly 


Val 
85 


Gly 


Ala 


Val 


Met 


Asp 
90 


Ser 


Met 


Glu 


Leu 


Glu 
95 


Arg 


Gin 


Arg 


Gly 


He 
100 


Thr 


He 


Gin 


Ser 


Ala 
105 


Ala 


Thr 


Phe 


Thr 


Met 
110 


Trp 


Lys 


Asp 


Val 


Asn 
115 


He 


Asn 


He 


He 


Asp 
120 


Thr 


Pro 


Gly 


His 


Val 
125 


Asp 


Phe 


Thr 


He 


Glu 
130 


Val 


Glu 


Arg 


Ala 


Leu 
135 


Arg 


Val 


Leu 


Asp 


Gly 
140 


Ala 


Val 


Leu 


Val 


Leu 
145 


Cys 


Ala 


Val 


Gly 


Gly 
150 


Val 


Gin 


Cys 


Gin 


Thr 
155 


Met 


Thr 


Val 


Asn 


Arg 
160 


Gin 


Met 


Lys 


Arg 


Tyr 
165 


Asn 


Val 


Pro 


Phe 


Leu 
170 


Thr 


Phe 


lie 


Asn 


Lys 
175 


Leu 


Asp 


Arg 


Met 


Gly 
180 


Ser 


Asn 


Pro 


Ala 


Arg 
185 


Ala 


Leu 


Gin 


Gin 


Met 
190 


Arg 


Ser 


Lys 


Leu 


Asn 
195 


His 


Asn 


Thr 


Ala 


Phe 
200 


Met 


Gin 


He 


Pro 


Met 
205 


Gly 




Glu 


Gly Asn 
210 


Phe 


Lys 


Gly 


He 


Val 
215 


Asp 


Leu 


He 


Glu 


Glu 
220 


Arg 


Ala 


He 


Tyr 


Phe 
225 


Asp 


Gly 


Asp 


Phe 


Ser 
230 


Gin 


He 


Val 


Arg 


Tyr 
235 


Gly 


Glu 


He 


Pro 


Ala 
240 


Glu 


Leu 


Arg 


Ala 


Ala 
245 


Ala 


Thr 


Asp 


His 


Arg 
250 


Gin 


Glu 


Leu 


He 


Glu 
255 


Cys 


Val 


Ala 


Asn 


Ser 
260 


Asp 


Glu 


Gin 


Leu 


Gly 
265 


Glu 


Met 


Phe 


Leu 


Glu 
270 


Glu 


Lys 


He 


Pro 


Ser 
275 


He 


Ser 


Asp 




Lys 
280 


Leu 


Ala 


He 


Arg 


Arg 
285 


Ala 


Thr 


Leu 




290 


Ser 


Phe 


Thr 


Pro 


Val 
295 


Phe 


Leu 


Gly 


Ser 


Ala 
300 


Leu 


Lys 


Asn 


Lys 


Gly Val 
305 


Gin 


Pro 


Leu 


Leu 
310 


Asp 


Ala 


Val 


Leu 


Glu 
315 


Tyr 


Leu 


Pro 


Asn 


Pro 
320 


Ser 


Glu 


Val 


Gin 


Asn 
325 


Tyr 


Ala 


He 




330 


Lys 


Lys 


Asp 


Asp 


Ser 
335 


Lys 


Glu 


Lys 


Thr 


Lys 
340 


He 




Met 


Asn 


Ser 
345 


Ser 


Arg 


His 


Asn 


Ser 
350 


His 


Pro 


Phe 


Val 


Gly 
355 


Leu 


Ala 


Phe 


Pro 


Leu 
360 


Glu 


Val 


Gly 


Arg 


Phe 
365 


Gly 


Gin 


Leu 


Thr 


Tyr 
370 


Val 


Arg 


Ser 


Tyr 


Gin 
375 


Gly 


Glu 


Leu 


Lys 


Lys 
380 


Gly 


Asp 


Thr 


He 


Tyr 
385 


Asn 


Thr 


Arg 


Thr 


Arg 
390 


Lys 


Lys 


Val 


Arg 


Leu 
395 


Gin 


Arg 


Leu 


Ala 


Arg 
400 
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Met 


His 


Ala 


Asp 


Met 
405 


Met 


Glu 


Ala 


Ser 


Thr 
410 


Glu 


Glu 


Val 


Tyr 


Ala Gly 
415 


Asp 


He 


Cys 


Ala 
420 


Leu 


Phe 


Gly 


He 


Asp 
425 


Cys 


Ala 


Ser 


Gly 


Asp 
430 


Thr 


Phe 


Thr 


Asp 


Lys 
435 


Ala 


Asn 


Ser 


Gly 


Leu 
440 


Ser 


Met 


Glu 


Ser 


He 
445 


His 


Val 


Pro 


Asp 


Pro 
450 


Val 


He 


Ser 


lie 


Ala 
455 


Met 


Lys 


Pro 


Ser 


Asn 
460 


Lys 


Asn 


Asp 


Leu 


Glu 
465 


Lys 


Phe 


Ser 


Lys 


Gly 
470 


He 


Gly 


Arg 


Phe 


Thr 
475 


Arg 


Glu 


Asp 


Pro 


Thr 
480 


Phe 


Lys 


Val 


Tyr 


Phe 
485 


Asp 


Thr 


Glu 


Asn 


Lys 
490 


Glu 


Thr 


Val 


He 


Ser 
495 


Gly 


Met 


Gly 


Glu 


Leu 
500 


His 


Leu 


Glu 


He 


Tyr 
505 


Ala 


Gin 


Arg 


Leu 


Glu 
510 


Arg 


Glu 


Tyr 


Gly 


Cys 
515 


Pro 


Cys 


He 


Thr 


Gly 
520 


Lys 


Pro 


Lys 


Val 


Ala 
525 


Phe 


Arg 


Glu 


Thr 


He 
530 


Thr 


Ala 


Pro 


Val 


Pro 
535 


Phe 


Asp 


Phe 


Thr 


His 
540 


Lys 


Lys 


Gin 


Ser 


Gly 
545 


Gly 


Ala 


Gly 


Gin 


Tyr 
550 


Gly 


Lys 


Val 


He 


Gly 
555 


Val 


Leu 


Glu 


Pro 


Leu 
560 


Asp 


Pro 


Glu 


Asp 


Tyr 
565 


Thr 


Lys 


Leu 


Glu 


Phe 
570 


Ser 


Asp 


Glu 


Thr 


Phe 
575 


Gly 


Ser 


Asn 


He 


Pro 
580 


Lys 


Gin 


Phe 


Val 


Pro 
585 


Ala 


Val 


Glu 


Lys 


Gly 
590 


Phe 


Leu 


Asp 


Ala 


Cys 
595 


Glu 


Lys 


Gly 


Pro 


Leu 
600 


Ser 


Gly 


His 


Lys 


Leu 
605 


Ser 


Gly 


Leu 


Arg 


Phe 
610 


Val 


Leu 


Gin 


Asp 


Gly 
615 


Ala 


His 


His 


Met 


Val 
620 


Asp 


Ser 


Asn 


Glu 


He 
625 


Ser 


Phe 


He 


Arg 


Ala 
630 


Gly 


Glu 


Gly 


Ala 


Leu 
635 


Lys 


Gin 


Ala 


Leu 


Ala 
640 


Asn 


Ala 


Thr 


Leu 


Cys 
645 


He 


Leu 


Glu 


Pro 


He 
650 


Met 


Ala 


Val 


Glu 


Val 
655 


Val 


Ala 


Pro 


Asn 


Glu 
660 


Phe 


Gin 


Gly 


Gin 


Val 
665 


He 


Ala 


Gly 


He 


Asn 
670 


Arg 


Arg 


His 


Gly 


Val 
675 


He 


Thr 


Gly 


Gin 


Asp 
680 


Gly 


Val 


Glu 


Asp 


Tyr 
685 


Phe 


Thr 


Leu 


Tyr 


Ala 
690 


Asp 


Val 


Pro 


Leu 


Asn 
695 


Asp 


Met 


Phe 


Gly 


Tyr 
700 


Ser 


Thr 


Glu 


Leu 


Arg 
705 


Ser 


Cys 


Thr 


Glu 


Gly 
710 


Lys 


Gly 


Glu 


Tyr 


Thr 
715 


Met 


Glu 


Tyr 


Ser 


Arg 
720 


Tyr 


Gin 


Pro 


Cys 


Leu 
725 


Pro 


Ser 


Thr 


Gin 


Glu 
730 


Asp 


Val 


He 


Asn 


Lys 
735 


Tyr 


Leu 


Glu 


Ala 


Thr 
740 


Gly 


Gin 


Leu 


Pro 


Val 
745 


Lys 


Lys 


Gly 


Lys 


Ala 
750 


Lys 


Asn 



Table 7 hTRG nucleotide sequence (SEQ ID NO:7) 



gccgcgggag 


caggcggagg 


cggaggcggc 


gggggcagga 


ggatgtcgca 


gccgccgctg 


60 


ctccccgcct 


cggcggagac 


tcggaagttc 


acccgggcgc 


tgagtaagcc 


gggcacggcg 


120 


gccgagctgc 


ggcagagcgt 


gtctgaggtg 


gtgcgcggct 


ccgtgctcct 


ggcaaagcca 


180 


aagctaattg 


agccactcga 


ctatgaaaat 


gtcatcgtcc 


agaagaagac 


tcagatcctg 


240 


aacgactgtt 


tacgggagat 


gctgctcttc 


ccttacgatg 


actttcagac 


ggccatcctg 


300 


agacgacagg 


gtcgatacat 


atgctcaaca 


gtgcctgcga 


aggcggaaga 


ggaagcacag 


360 


agcttgtttg 


ttacagagtg 


catcaaaacc 


tataactctg 


actggcatct 


tgtgaactat 


420 


aaatatgaag 


attactcagg 


agagtttcga 


cagcttccga 


acaaagtggt 


caagttggat 


480 


aaacttccag 


ttcatgtcta 


tgaagttgac 


gaggaggtcg 


acaaagatga 


ggatgctgcc 


540 


tcccttggtt 


cccagaaagg 


tgggatcacc 


aagcatggct 


ggctgtacaa 


aggcaacatg 


600 


aacagtgcca 


tcagcgtgac 


catgaggtca 


tttaagagac 


gatttttcca 


cctgattcaa 


660 


cttggcgatg 


gatcctataa 


atttgaattt 


ttaaaagatc 


tccaaaagga 


accaaaagga 


720 


tcaatatttc 


tgggattcct 


gtatggggtg 


tcgttcagga 


acaacaaagt 


caggcgtttt 


780 


gcttttgagc 


tcaagatgca 


ggacaaaagt 


agttatctct 


tggcagcaga 


cagtgaagtg 


840 


gaaatggaag 


aatggatcac 


aattctaaat 


aagatcctcc 


agctcaactt 


tgaagctgca 


900 


atgcaagaaa 


agcgaaatgg 


cgactctcac 


gaagatgatg 


aacaaagcaa 


attggaaggt 


960 


tctggttccg 


gtttagatag 


ctacctgccg 


gaacttgcca 


agagtgcaag 


agaagcagaa 


1020 


atcaaactga 


aaagtgaaag 


cagagtcaaa 


cttttttatt 


tggacccaga 


tgcccagaag 


1080 


cttgacttct 


catcagctga 


gccagaagtg 


aagtcatttg 


aagagaagtt 


tggaaaaagg 


1140 


atccttgtca 


agtgcaatga 


tttatctttc 


aatttgcaat 


gctgtgttgc 


cgaaaatgaa 


1200 


gaaggaccca 


ctacaaatgt 


tgaacctttc 


tttgttactc 


tatccctgtt 


tgacataaaa 


1260 


tacaaccgga 


agatttctgc 


cgatttccac 


gtagacctga 


accatttctc 


agtgaggcaa 


1320 


atgatcgcca 


ccacgtcccc 


ggcgctgatg 


aatggcagtg 


ggccggaaac 


ccaatctgcc 


1380 


ctcaggggca 


tccttcatga 


agccgccatg 


cagtatccga 


agcagggaat 


attttcagtc 


1440 


acttgtcctc 


atccagatat 


atttcttgtg 


gccagaattg 


aaaaagtcct 


tcaggggagc 


1500 


atcacacatt 


gcgctgagcc 


atatatgaaa 


agttcagact 


cttctaaggt 


ggcccagaag 


1560 


gtgctgaaga 


atgccaagca 


ggcatgccaa 


agactaggac 


agtatagaat 


gccatttgct 


1620 


tgggcagcaa 


ggacattgtt 


taaggatgca 


tctggaaatc 


ttgacaaaaa 


tgccagattt 


1680 


tctgccatct 


acaggcaaga 


cagcaataag 


ctatccaatg 


atgacatgct 


caagttactt 


1740 


gcagactttc 


ggaaacctga 


gaagatggct 


aagctcccag 


tgattttagg 


caatctagac 


1800 


attacaattg 


ataatgtttc 


ctcagacttc 


cctaattatg 


ttaattcatc 


atacattccc 


1860 


acaaaacaat 


ttgaaacctg 


cagtaaaact 


cccatcacgt 


ttgaagtgga 


ggaatttgtg 


1920 


ccctgcatac 


caaaacacac 


tcagccttac 


accatctaca 


ccaatcacct 


ttacgtttat 


1980 


cctaagtact 


tgaaatacga 


cagtcagaag 


tcttttgcca 


aggctagaaa 


tattgcgatt 


2040 


tgcattgaat 


tcaaagattc 


agatgaggaa 


gactctcagc 


cccttaagtg 


catttatggc 


2100 


agacctggtg 


ggccagtttt 


cacaagaagc 


gcctttgctg 


cagttttaca 


ccatcaccaa 


2160 


aacccagaat 


tttatgatga 


gattaaaata 


gagttgccca 


ctcagctgca 


tgaaaagcac 


2220 


cacctgttgc 


tcacattctt 


ccatgtcagc 


tgtgacaact 


caagtaaagg 


aagcacgaag 


2280 


aagagggatg 


tcgttgaaac 


ccaagttggc 


tactcctggc 


ttcccctcct 


gaaagacgga 


2340 


agggtggtga 


caagcgagca 


gcacatcccg 


gtctcggcga 


accttccttc 


gggctatctt 


2400 
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ggctaccagg agcttgggat gggcaggcat tatggtccgg aaattaaatg ggtagatgga 2460 

ggcaagccac tgctgaaaat ttccactcat ctggtttcta cagtgtatac tcaggatcag 252 0 

catttacata attttttcca gtactgtcag aaaaccgaat ctggagccca agccttagga 2580 

aacgaacttg taaagtacct taagagtctg catgcgatgg aaggccacgt gatgatcgcc 2640 

ttcttgccca ctatcctaaa ccagctgttc cgagtcctca ccagagccac acaggaagaa 2700 

gtcgcggtta acgtgactcg ggtcattatt catgtggttg cccagtgcca tgaggaagga 2760 

ttggagagcc acttgaggtc atatgttaag tacgcgtata aggctgagcc atatgttgcc 282 0 

tctgaataca agacagtgca tgaagaactg accaaatcca tgaccacgat tctcaagcct 2880 

tctgccgatt tcctcaccag caacaaacta ctgaagtact catggttttt ctttgatgta 2940 

ctgatcaaat ctatggctca gcatttgata gagaactcca aagttaagtt gctgcgaaac 3000 

cagagatttc ctgcatccta tcatcatgca gtggaaaccg ttgtaaatat gctgatgcca 3060 

cacatcactc agaagtttcg agataatcca gaggcatcta agaacgcgaa tcatagcctt 3120 

gctgtcttca tcaagagatg tttcaccttc atggacaggg gctttgtctt caagcagatc 3180 

aacaactaca ttagctgttt tgctcctgga gacccaaaga ccctctttga atacaagttt 3240 

gaatttctcc gtgtagtgtg caaccatgaa cattatattc cgttgaactt accaatgcca 3300 

tttggaaaag gcaggattca aagataccaa gacctccagc ttgactactc attaacagat 3360 

gagttctgca gaaaccactt cttggtggga ctgttactga gggaggtggg gacagccctc 3420 

caggagttcc gggaggtccg tctgatcgcc atcagtgtgc tcaagaacct gctgataaag 3480 

cattcttttg atgacagata tgcttcaagg agccatcagg caaggatagc caccctctac 3540 

ctgcctctgt ttggtctgct gattgaaaac gtccagcgga tcaatgtgag ggatgtgtca 3600 

cccttccctg tgaacgcggg catgactgtg aaggatgaat ccctggctct accagctgtg 3660 

aatccgctgg tgacgccgca gaagggaagc accctggaca acagcctgca caaggacctg 3720 

ctgggcgcca tctccggcat tgcttctcca tatacaacct caactccaaa catcaacagt 3780 

gtgagaaatg ctgattcgag aggatctctc ataagcacag attcgggtaa cagccttcca 3840 

gaaaggaata gtgagaagag caattccctg gataagcacc aacaaagtag cacattggga 3900 

aattccgtgg ttcgctgtga taaacttgac cagtctgaga ttaagagcct actgatgtgt 3960 

ttcctctaca tcttaaagag catgtctgat gatgctttgt ttacatattg gaacaaggct 4020 

tcaacatctg aacttatgga tttttttaca atatctgaag tctgcctgca ccagttccag 4080 

tacatgggga agcgatacat agccagaaca ggaatgatgc atgccagatt gcagcagctg 4140 

ggcagcctgg ataactctct cacttttaac cacagctatg gccactcgga cgcagatgtt 4200 

ctgcaccagt cattacttga agccaacatt gctactgagg tttgcctgac agctctggac 4260 

acgctttctc tatttacatt ggcgtttaag aaccagctcc tggccgacca tggacataat 4320 

cctctcatga aaaaagtttt tgatgtctac ctgtgttttc ttcaaaaaca tcagtctgaa 4380 

acggctttaa aaaatgtctt cactgcctta aggtccttaa tttataagtt tccctcaaca 4440 

ttctatgaag ggagagcgga catgtgtgcg gctctgtgtt acgagattct caagtgctgt 4500 

aactccaagc tgagctccat caggacggag gcctcccagc tgctctactt cctgatgagg 4560 

aacaactttg attacactgg aaagaagtcc tttgtccgga cacatttgca agtcatcata 4620 

tctgtcagcc agctgatagc agacgttgtt ggcattgggg gaaccagatt ccagcagtcc 4680 

ctgtccatca tcaacaactg tgccaacagt gaccggctta ttaagcacac cagcttctcc 4740 

tctgatgtga aggacttaac caaaaggata cgcacggtgc taatggccac cgcccagatg 4800 

aaggagcatg agaacgaccc agagatgctg gtggacctcc agtacagcct ggccaaatcc 4860 

tatgccagca cgcccgagct caggaagacg tggctcgaca gcatggccag gatccatgtc 492 0 

aaaaatggcg atctctcaga ggcagcaatg tgctatgtcc acgtaacagc cctagtggca 4980 

gaatatctca cacggaaaga agcagtccag tgggagccgc cccttctccc ccacagccat 5040 
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agcgcctgcc tgaggaggag ccggggaggc gtgtttagac aaggatgcac cgccttcagg 510 0 

gtcattaccc caaacatcga cgaggaggcc tccatgatgg aagacgtggg gatgcaggat 5160 

gtccatttca acgaggatgt gctgatggag ctccttgagc agtgcgcaga tggactctgg 5220 

aaagccgagc gctacgagct cattgccgac atctacaaac ttatcatccc catttatgag 5280 

aagcggaggg attttgagag gctggcccat ctgtatgaca cgctgcaccg ggcctacagc 5340 

aaagtgaccg aggtcatgca ctcgggccgc aggcttctgg ggacctactt ccgggtagcc 5400 

ttcttcgggc aggcagcgca ataccagttt acagacagtg aaacagatgt ggagggattc 5460 

tttgaagatg aagatggaaa ggagtatatt tacaaggaac ccaaactcac accgctgtcg 552 0 

gaaatttctc agagactcct taaactgtac tcggataaat ttggttctga aaatgtcaaa 5580 

atgatacagg attctggcaa ggtcaaccct aaggatctgg attctaagta tgcctacatc 5640 

caggtgactc acgtcatccc cttctttgac gaaaaagagt tgcaagaaag gaaaacagag 570 0 

tttgagagat cccacaacat ccgccgcttc atgtttgaga tgccatttac gcagaccggg 5760 

aagaggcagg gcggggtgga agagcagtgc aaacggcgca ccatcctgac agccatacac 582 0 

tgcttccctt atgtgaagaa gcgcatccct gtcatgtacc agcaccacac tgacctgaac 5880 

cccatcgagg tggccattga cgagatgagt aagaaggtgg cggagctccg gcagctgtgc 5940 

tcctcggccg aggtggacat gatcaaactg cagctcaaac tccagggcag cgtgagtgtt 6000 

caggtcaatg ctggcccact agcatatgcg cgagctttct tagatgatac aaacacaaag 606 0 

cgatatcctg acaataaagt gaagctgctt aaggaagttt tcaggcaatt tgtggaagct 612 0 

tgcggtcaag ccttagcggt aaacgaacgt ctgattaaag aagaccagct cgagtatcag 6180 

gaagaaatga aagccaacta cagggaaatg gcgaaggagc tttctgaaat catgcatgag 6240 

cagatctgcc ccctggagga gaagacgagc gtcttaccga attcccttca catcttcaac 6300 

gccatcagtg ggactccaac aagcacaatg gttcacggga tgaccagctc gtcttcggtc 6360 

gt qtga ttac atctcatggc ccgtgtgtgg ggacttgctt tgtcatttgc aaactcagga 6420 

tgctttccaa agccaatcac tggggagacc gagcacaggg aggaccaagg ggaaggggag 6480 

agaaaggaaa taaagaacaa cgttatttct taacagactt tctataggag ttgtaagaag 6540 

gtgcacatat ttttttaaat ctcactggca atattcaaag ttttcattgt gtcttaacaa 6600 

aggtgtggta gacactcttg agctggactt agattttatt cttccttgca gagtagtgtt 6660 

agaatagatg gcctacagaa aaaaaaggtt ctgggatcta catggcaggg agggctgcac 672 0 

tgacattgat gcctggggga ccttttgcct cgaggctgag ctggaaaatc ttgaaaatat 6780 

tttttttttc ctgtggcaca ttcaggttga atacaagaac tatttttgtg actagttttt 6840 

gatgacctaa gggaactgac cattgtaatt tttgtaccag tgaaccagga gatttagtgc 6900 

ttttatattc atttccttgc atttaagaaa atatgaaagc ttaaggaatt atgtgagctt 6960 

aaaactagtc aagcagttta gaaccaaagg cctatattaa taaccgcaac tatgctgaaa 702 0 

agtacaaagt agtacagtat attgttatgt acatatcatt gttaatacag tcctggcatt 7080 

ctgtacatat atgtattaca tttctacatt tttaatactc acatgggctt atgcattaag 7140 

tttaattgtg ataaatttgt gctgttccag tatatgcaat acactttaat gttttattct 7200 

tgtacataaa aatgtgcaat atggagatgt atacagtctt tactatatta ggtttataaa 7260 

cagttttaag aatttcatcc ttttgccaaa atggtggagt atgtaattgg taaatcataa 7320 

atcctgtggt gaatggtggt gtactttaaa gctgtcacca tgttatattt tcttttaaga 7380 

ctttaattta gtaattttat atttgggaaa ataaaggttt ttaattttat ttaactggaa 7440 

tcactgccct gctgtaatta aacattctgt accacatctg tattaaaaag acattgctga 7500 

ccatta 7506 
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Table 8 hTRG polypeptide sequence (SEQ ID NO: 8) 



Met 


Ser 


Gin 


Pro 


Pro 
5 


Leu 


Leu 


Pro 


Ala 


Ser 
10 


Ala 


Glu 


Thr 


Arg 


Lys 
15 


Phe 


Thr 


Arg 


Ala 


Leu 
20 


Ser 


Lys 


Pro 


Gly 


Thr 
25 


Ala 


Ala 


Glu 


Leu 


Arg 
30 


Gin 


Ser 


Val 


Ser 


Glu 
35 


Val 


Val 


Arg 


Gly 


Ser 
40 


Val 


Leu 


Leu 


Ala 


Lys 
45 


Pro 


Lys 


Leu 


He 


Glu 
50 


Pro 


Leu 


Asp 


Tyr 


Glu 
55 


Asn 


Val 


He 


Val 


Gin 
60 


Lys 


Lys 


Thr 


Gin 


He 
65 


Leu 


Asn 


Asp 


Cys 


Leu 
70 


Arg 


Glu 


Met 


Leu 


Leu 
75 


Phe 


Pro 


Tyr 


Asp 


Asp 
80 


Phe 


Gin 


Thr 


Ala 


He 
85 


Leu 


Arg 


Arg 


Gin 


Gly 
90 


Arg 


Tyr 


He 


Cys 


Ser 
95 


Thr 


Val 


Pro 


Ala 


Lys 
100 


Ala 


Glu 


Glu 


Glu 


Ala 
105 


Gin 


Ser 


Leu 


Phe 


Val 
110 


Thr 


Glu 


Cys 


He 


Lys 
115 


Thr 


Tyr 


Asn 


Ser 


Asp 
120 


Trp 


His 


Leu 


Val 


Asn 
125 


Tyr 


Lys 


Tyr 


Glu 


Asp 
130 


Tyr 


Ser 


Gly 


Glu 


Phe 
135 


Arg 


Gin 


Leu 


Pro 


Asn 
140 


Lys 


Val 


Val 


Lys 


Leu 

145 


Asp 


Lys 


Leu 


Pro 


Val 
150 


His 


Val 


Tyr 


Glu 


Val 
155 


Asp 


Glu 


Glu 


Val 


Asp 
160 


Lys 


Asp 


Glu 


Asp 


Ala 
165 


Ala 


Ser 


Leu 


Gly 


Ser 
170 


Gin 


Lys 


Gly 


Gly 


He 
175 


Thr 


Lys 


His 


Gly 


Trp 
180 


Leu 


Tyr 


Lys 


Gly 


Asn 
185 


Met 


Asn 


Ser 


Ala 


He 
190 


Ser 


Val 


Thr 


Met 


Arg 
195 


Ser 


Phe 


Lys 


Arg 


Arg 
200 


Phe 


Phe 


His 


Leu 


He 
205 


Gin 


Leu 


Gly 


Asp 


Gly 
210 


Ser 


Tyr 


Lys 


Phe 


Glu 
215 


Phe 


Leu 


Lys 


Asp 


Leu 
220 


Gin 


Lys 


Glu 


Pro 


Lys 
225 


Gly 


Ser 


He 


Phe 


Leu 
230 


Gly 


Phe 


Leu 


Tyr 


Gly 
235 


Val 


Ser 


Phe 


Arg 


Asn 
240 


Asn 


Lys 


Val 


Arg 


Arg 
245 


Phe 


Ala 


Phe 


Glu 


Leu 
250 


Lys 


Met 


Gin 


Asp 


Lys 
255 


Ser 


Ser 


Tyr 


Leu 


Leu 
260 


Ala 


Ala 


Asp 


Ser 


Glu 
265 


Val 


Glu 


Met 


Glu 


Glu 
270 


Trp 


He 


Thr 


He 


Leu 
275 


Asn 


Lys 


He 


Leu 


Gin 
280 


Leu 


Asn 


Phe 


Glu 


Ala 
285 


Ala 


Met 


Gin 


Glu 


Lys 
290 


Arg 


Asn 


Gly 


Asp 


Ser 
295 


His 


Glu 


Asp 


Asp 


Glu 
300 


Gin 


Ser 


Lys 


Leu 


Glu 
305 


Gly 


Ser 


Gly 


Ser 


Gly 
310 


Leu 


Asp 


Ser 


Tyr 


Leu 
315 


Pro 


Glu 


Leu 


Ala 


Lys 
320 


Ser 


Ala 


Arg 


Glu 


Ala 
325 


Glu 


He 


Lys 


Leu 


Lys 
330 


Ser 


Glu 


Ser 


Arg 


Val 
335 


Lys 


Leu 


Phe 


Tyr 


Leu 


Asp 


Pro 


Asp 


Ala 


Gin 


Lys 


Leu 


Asp 


Phe 


Ser 


Ser 


Ala 
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340 










345 










350 






Glu 


Pro 


Glu 
355 


Val 


Lys 


Ser 


Phe 


Glu 
360 


Glu 


Lys 


Phe 


Gly 


Lys 
365 


Arg 


He 


Leu 


Val 


Lys 
370 


Cys 


Asn 


Asp 


Leu 


Ser 
375 


Phe 


Asn 


Leu 


Gin 


Cys 
380 


Cys 


Val 


Ala 


Glu 


Asn 
385 


Glu 


Glu 


Gly 


Pro 


Thr 
390 


Thr 


Asn 


Val 


Glu 


Pro 
395 


Phe 


Phe 


Val 


Thr 


Leu 
400 


Ser 


Leu 


Phe 


Asp 


He 
405 


Lys 


Tyr 


Asn 


Arg 


Lys 
410 


He 


Ser 


Ala 


Asp 


Phe 
415 


His 


Val 


Asp 


Leu 


Asn 
420 


His 


Phe 


Ser 


Val 


Arg 
425 


Gin 


Met 


He 


Ala 


Thr 
430 


Thr 


Ser 


Pro 


Ala 


Leu 
435 


Met 


Asn 


Gly 


Ser 


Gly 
440 


Pro 


Glu 


Thr 


Gin 


Ser 
445 


Ala 


Leu 


Arg 


Gly 


He 
450 


Leu 


His 


Glu 


Ala 


Ala 
455 


Met 


Gin 


Tyr 


Pro 


Lys 
460 


Gin 


Gly 


He 


Phe 


Ser 
465 


Val 


Thr 


Cys 


Pro 


His 
470 


Pro 


Asp 


He 


Phe 


Leu 
475 


Val 


Ala 


Arg 


He 


Glu 
480 


Lys 


Val 


Leu 


Gin 


Gly 
485 


Ser 


He 


Thr 


His 


Cys 
490 


Ala 


Glu 


Pro 


Tyr 


Met 
495 


Lys 


Ser 


Ser 


Asp 


Ser 
500 


Ser 


Lys 


Val 


Ala 


Gin 
505 


Lys 


Val 


Leu 


Lys 


Asn 
510 


Ala 


Lys 


Gin 


Ala 


Cys 
515 


Gin 


Arg 


Leu 


Gly 


Gin 
520 


Tyr 


Arg 


Met 


Pro 


Phe 
525 


Ala 


Trp 


Ala 


Ala 


Arg 
530 


Thr 


Leu 


Phe 


Lys 


Asp 
535 


Ala 


Ser 


Gly 


Asn 


Leu 
540 


Asp 


Lys 


Asn 


Ala 


Arg 
545 


Phe 


Ser 


Ala 


He 


Tyr 
550 


Arg 


Gin 


Asp 


Ser 


Asn 
555 


Lys 


Leu 


Ser 


Asn 


Asp 
560 


Asp 


Met 


Leu 


Lys 


Leu 
565 


Leu 


Ala 


Asp 


Phe 


Arg 
570 


Lys 


Pro 


Glu 


Lys 


Met 
575 


Ala 


Lys 


Leu 


Pro 


Val 
580 


He 


Leu 


Gly 


Asn 


Leu 
585 


Asp 


He 


Thr 


He 


Asp 
590 


Asn 


Val 


Ser 


Ser 


Asp 
595 


Phe 


Pro 


Asn 


Tyr 


Val 
600 


Asn 


Ser 


Ser 


Tyr 


He 
605 


Pro 


Thr 


Lys 


Gin 


Phe 
610 


Glu 


Thr 


Cys 


Ser 


Lys 
615 


Thr 


Pro 


He 


Thr 


Phe 
620 


Glu 


Val 


Glu 


Glu 


Phe 
625 


Val 


Pro 


Cys 


He 


Pro 
630 


Lys 


His 


Thr 


Gin 


Pro 
635 


Tyr 


Thr 


He 


Tyr 


Thr 
640 


Asn 


His 


Leu 


Tyr 


Val 
645 


Tyr 


Pro 


Lys 


Tyr 


Leu 
650 


Lys 


Tyr 


Asp 


Ser 


Gin 
655 


Lys 


Ser 


Phe 


Ala 


Lys 
660 


Ala 


Arg 


Asn 


He 


Ala 
665 


He 


Cys 


He 


Glu 


Phe 
670 


Lys 


Asp 


Ser 


Asp 


Glu 
675 


Glu 


Asp 


Ser 


Gin 


Pro 
680 


Leu 


Lys 


Cys 


He 


Tyr 
685 


Gly 


Arg 


Pro 


Gly Gly 


Pro 


Val 


Phe 


Thr 


Arg 


Ser 


Ala 


Phe 


Ala 


Ala 


Val 




His 


His 
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690 










695 










700 










His 
705 


Gin 


Asn 


Pro 


Glu 


Phe 
710 


Tyr 


Asp 


Glu 


lie 


Lys 
715 


lie 


Glu 


Leu 


Pro 


Thr 
720 


Gin 


Leu 


His 


Glu 


Lys 
725 


His 


His 


Leu 


Leu 


Leu 
730 


Thr 


Phe 


Phe 


His 


Val 
735 


Ser 


Cys 


Asp 


Asn 


Ser 
740 


Ser 


Lys 


Gly 


Ser 


Thr 
745 


Lys 


Lys 


Arg 


Asp 


Val 
750 


Val 


Glu 


Thr 


Gin 


Val 
755 


Gly 


Tyr 


Ser 


Trp 


Leu 
760 


Pro 


Leu 


Leu 


Lys 


Asp 
765 


Gly 


Arg 


Val 


Val 


Thr 
770 


Ser 


Glu 


Gin 


His 


lie 
775 


Pro 


Val 


Ser 


Ala 


Asn 
780 


Leu 


Pro 


Ser 


Gly 


Tyr 
785 


Leu 


Gly 


Tyr 


Gin 


Glu 
790 


Leu 


Gly 


Met 


Gly 


Arg 
795 


His 


Tyr 


Gly 


Pro 


Glu 
800 


lie 


Lys 


Trp 


Val 


Asp 
805 


Gly 


Gly 


Lys 


Pro 


Leu 
810 


Leu 


Lys 


lie 


Ser 


Thr 
815 


His 


Leu 


Val 


Ser 


Thr 
820 


Val 


Tyr 


Thr 


Gin 


Asp 
825 


Gin 


His 


Leu 


His 


Asn 
830 


Phe 


Phe 


Gin 


Tyr 


Cys 
835 


Gin 


Lys 


Thr 


Glu 


Ser 
840 


Gly 


Ala 


Gin 


Ala 


Leu 
845 


Gly 


Asn 


Glu 


Leu 


Val 
850 


Lys 


Tyr 


Leu 


Lys 


Ser 
855 




His 


Ala 


Met 


Glu 
860 


Gly 


His 


Val 


Met 


lie 
865 


Ala 


Phe 


Leu 


Pro 


Thr 
870 


lie 






Gin 


875 


Phe 


Arg 


Val 


Leu 


Thr 
880 


Arg 


Ala 


Thr 


Gin 


Glu 
885 


Glu 


Val 


Ala 


Val 


890 


Val 


Thr 


Arg 


Val 


lie 
895 


He 


His 


Val 


Val 


Ala 
900 


Gin 


Cys 


His 


Glu 


Glu 
905 


Gly 


Leu 


Glu 


Ser 


His 
910 


Leu 


Arg 


Ser 


Tyr 


Val 
915 


Lys 


Tyr 


Ala 


Tyr 


Lys 
920 


Ala 


Glu 


Pro 


Tyr 


Val 
925 


Ala 


Ser 


Glu 


Tyr 


Lys 
930 


Thr 


Val 


His 


Glu 


Glu 
935 




Thr 




Ser 


Met 
940 


Thr 


Thr 


lie 


Leu 


Lys 
945 


Pro 


Ser 


Ala 




Phe 
950 




Thr 


Ser 




955 




Leu 


Lys 


Tyr 


Ser 
960 


Trp 


Phe 


Phe 


Phe 


Asp 
965 


Val 


Leu 


lie 


Lys 


Ser 
970 


Met 


Ala 


Gin 


His 


Leu 
975 


He 


Glu 


Asn 


Ser 


Lys 
980 


Val 


Lys 


Leu 


Leu 


Arg 
985 


Asn 


Gin 


Arg 


Phe 


Pro 
990 


Ala 


Ser 


Tyr 
lie 


His 


His 
995 


Ala 


Val 


Glu 


Thr 


Val Val Asn Met Leu Met Pro His 
1000 1005 


Thr 


Gin Lys Phe Arg Asp Asn Pro Glu Ala Ser Lys Asn Ala Asn 
1010 1015 1020 


His 


Ser Leu Ala Val Phe lie Lys Arg Cys Phe Thr Phe Met Asp 
1025 1030 1035 
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1040 










1045 










1050 










1055 










1060 








Tyr 


1065 










107 0 










1075 










1080 










1085 










1090 








Arg 


1095 










1100 










1105 










1110 


Asn 


His 


Phe 




Val 
1115 


Gly 


Leu 


Leu 


Leu 


Arg 
1120 


Glu 


Val 


Gly 


Thr 


Ala 
1125 


Leu 


Gin 


Glu 


Phe 


Arg 
1130 


Glu 


Val 


Arg 




lie 
1135 


Ala 


He 


Ser 


Val 


Leu 
1140 


Lys 


Asn 


Leu 


Leu 


lie 
1145 


Lys 


His 


Ser 


Phe 


Asp 
1150 


Asp 


Arg 


Tyr 


Ala 


Ser 
1155 


Arg 


Ser 


His 


Gin 


Ala 
1160 


Arg 


He 


Ala 


Thr 


Leu 
1165 


Tyr 


Leu 


Pro 


Leu 


Phe 
1170 


Gly 


Leu 


Leu 


lie 


Glu 
1175 


Asn 


Val 


Gin 


Arg 


He 
1180 


Asn 


Val 


Arg 


Asp 


Val 
1185 


Ser 


Pro 


Phe 


Pro 


Val 
1190 


Asn 


Ala 


Gly 


Met 


Thr 
1195 


Val 


Lys 


Asp 


Glu 


Ser 
1200 


Leu 


Ala 


Leu 


Pro 


Ala 
1205 


Val 


Asn 


Pro 


Leu 


Val 
1210 


Thr 


Pro 


Gin 


Lys 


Gly 
1215 


Ser 


Thr 


Leu 


Asp 


Asn 
1220 


Ser 


Leu 


His 


Lys 


Asp 
1225 


Leu 


Leu 


Gly 


Ala 


He 
1230 


Ser 


Gly 


He 


Ala 


Ser 
1235 


Pro 


Tyr 


Thr 


Thr 


Ser 
1240 


Thr 


Pro 


Asn 


He 


Asn 
1245 


Ser 


Val 


Arg 


Asn 


Ala 
1250 


Asp 


Ser 


Arg 


Gly 


Ser 
1255 


Leu 


He 


Ser 


Thr 


Asp 
1260 


Ser 


Gly 


Asn 




Leu 
1265 


Pro 


Glu 


Arg 


Asn 


Ser 
1270 


Glu 


Lys 


Ser 


Asn 


Ser 
1275 


Leu 


Asp 


Lys 




1280 




Ser 


Ser 




1285 


Gly 




Ser 


Val 


Val 
1290 


Arg 


Cys 


Asp 




1295 










1300 










1305 








Tyr 


He 
1310 


Leu 


Lys 


Ser 


Met 


Ser 
1315 


Asp 


Asp 


Ala 




Phe 
1320 


Thr 


Tyr 


Trp 


Asn 


Lys 
1325 


Ala 


Ser 


Thr 


Ser 


Glu 
1330 


Leu 


Met 


Asp 


Phe 


Phe 
1335 


Thr 


He 


Ser 


Glu 


Val 
1340 


Cys 




His 


Gin 


Phe 
1345 


Gin 


Tyr 


Met 


Gly 


Lys 
1350 


Arg 


Tyr 


He 


Ala 


Arg 
1355 


Thr 


Gly 


Met 


Met 


His 
1360 


Ala 


Arg 


Leu 


Gin 


Gin 
1365 


Leu 


Gly 


Ser 
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1370 










1375 










1380 










1385 










1390 










1395 










1400 










1405 










1410 










1415 










1420 










1425 










143 0 










1435 






Cys 




1440 










1445 










145 0 










1455 


Leu 


Arg 


Ser 


Leu 


lie 
1460 


Tyr 


Lys 


Phe 


Pro 


Ser 
1465 


Thr 


Phe 


Tyr 


Glu 


Gly 
1470 




Ala 


Asp 


Met 


Cys 
1475 


Ala 


Ala 


Leu 


Cys 


Tyr 
1480 


Glu 


He 


Leu 


Lys 


Cys 
1485 


Cys 


Asn 


Ser 


Lys 


Leu 
1490 


Ser 


Ser 


lie 


Arg 


Thr 
1495 


Glu 


Ala 


Ser 


Gin 


1500 




Tyr 


Phe 


Leu 


Met 
1505 


Arg 


Asn 


Asn 


Phe 


Asp 
1510 


Tyr 


Thr 


Gly 




1515 


Ser 


Phe 


Val 


Arg 


Thr 
1520 


His 


Leu 


Gin 


Val 


He 
1525 


He 


Ser 


Val 


Ser 


Gin 
1530 


Leu 


He 


Ala 


Asp 


Val 
1535 


Val 


Gly 


lie 


Gly 


Gly 
1540 


Thr 


Arg 


Phe 


Gin 


Gin 
1545 


Ser 


Leu 


Ser 


lie 


lie 
1550 


Asn 


Asn 


Cys 


Ala 


Asn 
1555 


Ser 


Asp 


Arg 




lie 
1560 








Ser 


Phe 
1565 


Ser 


Ser 


Asp 


Val 


Lys 
1570 


Asp 


Leu 


Thr 


Lys 


Arg 
1575 


lie 


Arg 


Thr 




1580 










1585 






Glu 




Glu 
1590 


Asn 


Asp 


Pro 


Glu 


Met 
1595 


Leu 


Val 


Asp 


Leu 


Gin 
1600 


Tyr 


Ser 


Leu 


Ala 


Lys 
1605 


Ser 


Tyr 


Ala 


Ser 


Thr 
1610 


Pro 


Glu 


Leu 


Arg 


Lys 
1615 


Thr 


Trp 


Leu 


Asp 


Ser 
1620 


Met 


Ala 


Arg 


lie 


His 
1625 


Val 


Lys 


Asn 


Gly 


Asp 
1630 


Leu 


Ser 


Glu 


Ala 


Ala 
1635 


Met 


Cys 


Tyr 


Val 


His 
1640 


Val 


Thr 


Ala 


Leu 


Val 
1645 


Ala 


Glu 


Tyr 


Leu 


Thr 
1650 


Arg 


Lys 


Glu 


Ala 


Val 
1655 


Gin 


Trp 


Glu 


Pro 


Pro 
1660 


Leu 


Leu 


Pro 


His 


Ser 
1665 


His 


Ser 


Ala 


Cys 


Leu 
1670 


Arg 


Arg 


Ser 


Arg 


Gly 
1675 


Gly 


Val 


Phe 


Arg 


Gin 
1680 


Gly Cys 


Thr 


Ala 


Phe 
1685 


Arg 


Val 


He 


Thr 


Pro 
1690 


Asn 


He 


Asp 


Glu 


Glu 
1695 


Ala 


Ser 


Met 
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1700 










1705 










1710 




Asp Val 


Leu 


Met 
1715 


Glu 


Leu 


Leu 


Glu 


1720 


Cys 








1725 






Ala 




1730 










1735 










1740 




He 


Pro 


lie 


Tyr 
1745 


Glu 


Lys 


Arg 


Arg 


Asp 
1750 


Phe 


Glu 


Arg 


Leu 


Ala 
1755 


His 






Asp 


Thr 
1760 


Leu 


His 


Arg 


Ala 


Tyr 
1765 


Ser 


Lys 


Val 


Thr 


Glu 
1770 


Val 


Met 


His 


Ser 


Gly 
1775 


Arg 


Arg 


Leu 


Leu 


Gly 
1780 


Thr 


Tyr 


Phe 


Arg 


Val 
1785 


Ala 






Gly 


Gin 
1790 


Ala 


Ala 


Gin 


Tyr 


Gin 
1795 


Phe 


Thr 


Asp 


Ser 


Glu 
1800 


Thr 




Val 


Glu 


Gly 
1805 


Phe 


Phe 


Glu 


Asp 


Glu 
1810 


Asp 


Gly 


Lys 


Glu 


Tyr 
1815 


He 






Glu 


Pro 
1820 


Lys 


Leu 


Thr 


Pro 


Leu 
1825 


Ser 


Glu 


He 


Ser 


Gin 
1830 


Arg 






Lys 


Leu 
1835 


Tyr 


Ser 


Asp 


Lys 


Phe 
1840 


Gly 


Ser 


Glu 


Asn 


Val 
1845 


Lys 




He 


Gin 


Asp 
1S50 


Ser 


Gly 


Lys 


Val 


Asn 
1855 


Pro 


Lys 


Asp 


Leu 


Asp 
1860 


Ser 






Ala 


Tyr 
1865 


lie 


Gin 


Val 


Thr 


His 
1870 


Val 


He 


Pro 


Phe 


Phe 
1875 


Asp 






Glu 


Leu 
1880 


Gin 


Glu 


Arg 


Lys 


Thr 
1885 


Glu 


Phe 


Glu 


Arg 


Ser 
1890 


His 




He 


Arg 


Arg 
1895 


Phe 


Met 


Phe 


Glu 


Met 
1900 


Pro 


Phe 


Thr 


Gin 


Thr 
1905 


Gly 






Gin 


Gly 
1910 


Gly 


Val 


Glu 


Glu 


Gin 
1915 


Cys 


Lys 


Arg 


Arg 


Thr 
1920 


He 


Leu 


Thr 


Ala 


lie 
1925 


His 


Cys 


Phe 


Pro 


Tyr 
1930 


Val 


Lys 


Lys 


Arg 


He 
1935 


Pro 


Val 


Met 


Tyr 


Gin 
1940 


His 


His 


Thr 


Asp 


Leu 
1945 


Asn 


Pro 


He 


Glu 


Val 
1950 


Ala 


He 


Asp 


Glu 


Met 
1955 


Ser 


Lys 


Lys 


Val 


Ala 
1960 


Glu 


Leu 


Arg 


Gin 


Leu 
1965 


Cys 


Ser 


Ser 


Ala 


Glu 
1970 


Val 


Asp 


Met 


He 


Lys 
1975 


Leu 


Gin 


Leu 


Lys 


Leu 
1980 


Gin 


Gly Ser 


Val 


Ser 
1985 


Val 


Gin 


Val 


Asn 


Ala 
1990 


Gly 


Pro 


Leu 


Ala 


Tyr 
1995 


Ala 


Arg 


Ala 


Phe 


Leu 
2000 


Asp 


Asp 


Thr 


Asn 


Thr 
2005 


Lys 


Arg 


Tyr 


Pro 


Asp 
2010 


Asn 


Lys 


Val 


Lys 


Leu 
2015 


Leu 


Lys 


Glu 


Val 


Phe 
2020 


Arg 


Gin 


Phe 


Val 


Glu 
2025 


Ala 


Cys 


Gly 
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2030 










2035 










2040 










2 04 5 










2050 






Tyr 




2055 








Glu 


Leu 
2060 


Ser 


Glu 


He 


Met 


His 
2065 


Glu 


Gin 


He 


Cys 


Pro 
2070 


Leu 


Glu 


Glu 


Lys 


Thr 
2075 


Ser 


Val 


Leu 


Pro 


Asn 
2080 


Ser 


Leu 


His 


He 


Phe 
2085 


Asn 


Ala 


He 


Ser 


Gly 
2090 


Thr 


Pro 


Thr 


Ser 


Thr 
2095 


Met 


Val 


His 


Gly 


Met 
2100 


Thr 


Ser 


Ser 


Ser 


Ser 
2105 


Val 


Val 

























Table 9 hMXl nucleotide sequence (SEQ ID NO:9) 



ttttgtttac 


agggaacacg 


ggtctggctg 


agagaaaatg 


gccagcattt 


tccaagtact 


60 


gtaaattcct 


gtgcagaagg 


catcgtcgtc 


ttccggacag 


actatggtca 


ggtattcact 


120 


tacaagcaga 


gcacaattac 


ccaccagaag 


gtgactgcta 


tgcaccccac 


gaacgaggag 


180 


ggcgtggatg 


acatggcgtc 


cttgacagag 










ttccagcggt 


ataagagaaa 


tcaaatatgg 


acctacatcg 


gctccatcct 


ggcctctgtg 


300 


aacccctacc 


agcccatcgc 


cgggctgtac 


gagcctgcca 


ccatggagca 


gtacagccgg 


360 


cgccacctgg 


gcgagctgcc 


cccgcacatc 


ttcgccatcg 


ccaacgagtg 


ctaccgctgc 


420 


ctgtggaagc 


gccacgacaa 


ccagtgcatc 


ctcatcaagg 


gtgaaagtgg 


ggcaggtaaa 


480 


accgaaagca 


ctaaattgat 


cctcaagttt 


ctgtcagtca 


tcagtcaaca 


gtctttggaa 


540 


ttgtccttaa 


aggagaagac 


atcctgtgtt 


gaacgagcta 


ttcttgaaag 


cagccccatc 


600 


atggaagctt 


tcggcaatgc 


gaagaccgtg 


tacaacaaca 


actctagtcg 


ctttgggaag 


660 


tttgttcagc 


tgaacatctg 


tcagaaagga 


aatattcagg 


gcgggagaat 


tgtagattgt 


720 


atcctctctt 


cccagaaccg 


agtagtaagg 


caaaatcccg 


gggaaaggaa 


ttatcacata 


780 


ttttatgcac 


tgctggcagg 


gctggaacat 


gaagaaagag 


aagaatttta 


tttatctacg 


840 


ccagaaaact 


accactactt 


gaatcagtct 


ggatgtgtag 


aagacaagac 


aatcagtgac 


900 


caggaatcct 


ttagggaagt 


tattacggca 


atggacgtga 


tgcagttcag 


caaggaggaa 


960 


gttcgggaag 


tgtcgaggct 


gcttgctggt 


atactgcatc 


ttgggaacat 


agaatttatc 


1020 


actgctggtg 


gggcacaggt 


ttccttcaaa 


acagctttgg 


gcagatctgc 


ggagttactt 


1080 


gggctggacc 


caacacagct 


cacagatgct 


ttgacccaga 


gatcaatgtt 


cctcagggga 


1140 


gaagagatcc 


tcacgcctct 


caatgttcaa 


caggcagtag 


acagcaggga 


ctccctggcc 


1200 


atggctctgt 


atgcgtgctg 


ctttgagtgg 


gtaatcaaga 


agatcaacag 


caggatcaaa 


1260 


ggcaatgagg 


acttcaagtc 


tattggcatc 


ctcgacatct 


ttggatttga 


aaactttgag 


1320 


gttaatcact 


ttgaacagtt 


caatataaac 


tatgcaaacg 


agaaacttca 


ggagtacttc 


1380 


aacaagcata 


ttttttcttt 


agaacaacta 


gaatatagca 


gggaaggatt 


agtgtgggaa 


1440 


gatattgact 


ggatagacaa 


tggagaatgc 


ctggacttga 


ttgagaagaa 


acttggcctc 


1500 


ctagccctta 


tcaatgaaga 


aagccatttt 


cctcaagcca 


cagacagcac 


cttattggag 


1560 


aagctacaca 


gtcagcatgc 


gaataaccac 


ttttatgtga 


agcccagagt 


tgcagttaac 


1620 
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aattttggag tgaagcacta tgctggagag gtgcaatatg atgtccgagg tatcttggag 1680 

aagaacagag atacatttcg agatgacctt ctcaatttgc taagagaaag ccggtttgac 1740 

tttatctacg atctttttga acatgtttca agccgcaaca accaggatac cttgaaatgt 1800 

ggaagcaaac atcggcggcc tacagtcagc tcacagttca aggttgactc actgcattcc 1860 

ttaatggcaa cgctaagctc ctctaatcct ttctttgttc gctgtatcaa gccaaacatg 1920 

cagaagatgc cagaccagtt tgaccaggcg gttgtgctga accagctgcg gtactcaggg 1980 

atgctggaga ctgtgagaat ccgcaaagct gggtatgcgg tccgaagacc ctttcaggac 2 040 

ttttacaaaa ggtataaagt gctgatgagg aatctggctc tgcctgagga cgtccgaggg 2100 

aagtgcacga gcctgctgca gctctatgat gcctccaaca gcgagtggca gctggggaag 2160 

accaaggtat ttcttcgaga atccttggaa cagaaactgg agaagcggag ggaagaggaa 2220 

gtgagccacg cggccatggt gattcgggcc catgtcttgg gcttcttagc acggaaacaa 2280 

tacagaaagg tcctttattg tgtggtgata atacagaaga attacagagc attccttctg 2340 

aggaggagat ttttgcacct gaaaaaggca gccatagttt tccagaagca actcagaggt 2400 

cagattgctc ggagagttta cagacaattg ctggcagaga aaagggagca agaagaaaag 2460 

aagaaacagg aagaggaaga aaagaagaaa cgggaggaag aagaaagaga aagagagaga 252 0 

gagcgaagag aagccgagct ccgcgcccag caggaagaag aaacgaggaa gcagcaagaa 2580 

ctcgaagcct tgcagaagag ccagaaggaa gctgaactga cccgtgaact ggagaaacag 2640 

aaggaaaata agcaggtgga agagatcctc cgtctggaga aagaaatcga ggacctgcag 2700 

cgcatgaagg agcagcagga gctgtcgctg accgaggctt ccctgcagaa gctgcaggag 2760 

cggcgggacc aggagctccg caggctggag gaggaagcgt gcagggcggc ccaggagttc 282 0 

ctcgagtccc tcaatttcga cgagatcgac gagtgtgtcc ggaatatcga gcggtccctg 2880 

tcggggggaa gcgaattttc cagcgagctg gctgagagcg catgcgagga gaagcccaac 2940 

ttcaacttca gccagcccta cccagaggag gaggtcgatg agggcttcga agccgacgac 3000 

gacgccttca aggactcccc caaccccagc gagcacggcc actcagacca gcgaacaagt 3060 

ggcatccgga ccagcgatga ctcttcagag gaggacccat acatgaacga cacggtggtg 312 0 

cccaccagcc ccagtgcgga cagcacggtg ctgctcgccc catcagtgca ggactccggg 3180 

agcctacaca actcctccag cggcgagtcc acctactgca tgccccagaa cgctggggac 3240 

ttgccctccc cagacggcga ctacgactac gaccaggatg actatgagga cggtgccatc 3300 

acttccggca gcagcgtgac cttctccaac tcctacggca gccagtggtc ccccgactac 33 60 

cgctgctctg tggggaccta caacagctcg ggtgcctacc ggttcagctc tgagggggcg 3420 

cagtcctcgt ttgaagatag tgaagaggac tttgattcca ggtttgatac agatgatgag 3480 

ctttcatacc ggcgtgactc tgtgtacagc tgtgtcactc tgccgtattt ccacagcttt 3540 

ctgtacatga aaggtggcct gatgaactct tggaaacgcc gctggtgcgt cctcaaggat 3600 

gaaaccttct tgtggttccg ctccaagcag gaggccctca agcaaggctg gctccacaaa 3660 

aaaggggggg gctcctccac gctgtccagg agaaattgga agaagcgctg gtttgtcctc 3720 

cgccagtcca agctgatgta ctttgaaaac gacagcgagg agaagctcaa gggcaccgta 3780 

gaagtgcgaa cggcaaaaga gatcatagat aacaccacca aggagaatgg gatcgacatc 3840 

attatggccg ataggacttt ccacctgatt gcagagtccc cagaagatgc cagccagtgg 3900 

ttcagcgtgc tgagtcaggt ccacgcgtcc acggaccagg agatccagga gatgcatgat 3960 

gagcaggcaa acccacagaa tgctgtgggc accttggatg tggggctgat tgattctgtg 402 0 

tgtgcctctg acagccctga tagacccaac tcgtttgtga tcatcacggc caaccgggtg 4080 

ctgcactgca acgccgacac gccggaggag atgcaccact ggataaccct gctgcagagg 4140 

tccaaagggg acaccagagt ggagggccag gaattcatcg tgagaggatg gttgcacaaa 42 00 

gaggtgaaga acagtccaaa gatgtcttca ctgaaactga agaaacggtg gtttgtactc 4260 
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acccacaatt 


ccctggatta 


ctacaagagt 


tcagagaaga 


acgcgctcaa 


actggggacc 


4320 


ctggtcctca 


acagcctctg 


ctctgtcgtc 


cccccagatg 


agaagatatt 


caaagagaca 


4380 


ggctactgga 


acgtcaccgt 


gtacgggcgc 


aagcactgtt 


accggctcta 


caccaagctg 


4440 


ctcaacgagg 


ccacccggtg 


gtccagtgtc 


attcaaaacg 


tgactgacac 


caaggccccg 


4500 


atcgacaccc 


ccacccagca 


gctgattcaa 


gatatcaagg 


agaactgcct 


gaactcggat 


4560 


gtggtggaac 


agatttacaa 


gcggaacccg 


atccttcgat 


acacccatca 


ccccttgcac 


4620 


tccccgctcc 


tgccccttcc 


gtatggggac 


ataaatctca 


acttgctgaa 


agacaaaggc 


4680 


tataccaccc 


ttcaggatga 


ggccatcaag 


atattcaatt 


ccctgcagca 


actggagtcc 


4740 


atgtctgacc 


caattccaat 


aatccagggc 


atcctacaga 


cagggcatga 


cctgcgacct 


4800 


ctgcgggacg 


agctgtactg 


ccagcttatc 


aaacagacca 


acaaagtgcc 


ccaccccggc 


4860 


agtgtgggca 


acctgtacag 


ctggcagatc 


ctgacatgcc 


tgagctgcac 


cttcctgccg 


4920 


agtcgaggga 


ttctcaagta 


tctcaagttc 


catctgaaaa 


ggatacggga 


acagtttcca 


4980 


ggaaccgaga 


tggaaaaata 


cgctctcttc 


acttacgaat 


ctcttaagaa 


aaccaaatgc 


5040 


cgagagtttg 


tgccttcccg 


agatgaaata 


gaagctctga 


tccacaggca 


ggaaatgaca 


5100 


tccacggtct 


attgccatgg 


cggcggctcc 


tgcaagatca 


ccatcaactc 


ccacaccacc 


5160 


gctggggagg 


tggtggagaa 


gctgatccga 


ggcctggcca 


tggaggacag 


caggaacatg 


5220 


tttgctttgt 


ttgaatacaa 


cggccacgtc 


gacaaagcca 


ttgaaagtcg 


aaccgtcgta 


5280 


gctgatgtct 


tagccaagtt 


tgaaaagctg 


gctgccacat 


ccgaggttgg 


ggacctgcca 


5340 


tggaaattct 


acttcaaact 


ttactgcttc 


ctggacacag 


acaacgtgcc 


aaaagacagt 


5400 


gtggagtttg 


catttatgtt 


tgaacaggcc 


cacgaagcgg 


ttatccatgg 


ccaccatcca 


5460 


gccccggaag 


aaaacctcca 


ggttcttgct 


gccctgcgac 


tccagtatct 


gcagggggat 


5520 


tatactctgc 


acgctgccat 


cccacctctc 


gaagaggttt 


attccctgca 


gagac t caag 


5580 


gcccgcatca 


gccagtcaac 


caaaaccttc 


accccttgtg 


aacggctgga 


gaagaggcgg 


5640 


acgagcttcc 


tagaggggac 


cctgaggcgg 


agcttccgga 


caggatccgt 


ggtccggcag 


5700 


aaggtcgagg 


aggagcagat 


gctggacatg 


tggattaagg 


aagaagtctc 


ctctgctcga 


5760 


gccagtatca 


ttgacaagtg 


gaggaaattt 


cagggaatga 


accaggaaca 


ggccatggcc 


5820 


aagtacatgg 


ccttgatcaa 


ggagtggcct 


ggctatggct 


cgacgctgtt 


tgatgtggag 


5880 


tgcaaggaag 


gtggcttccc 


tcaggaactc 


tggttgggtg 


tcagcgcgga 


cgccgtctcc 


5940 


gtctacaagc 


gtggagaggg 


aagaccactg 


gaagtcttcc 


agtatgaaca 


catcctctct 


6000 


tttggggcac 


ccctggcgaa 


tacgtataag 


atcgtggtcg 


atgagaggga 


gctgctcttt 


6060 


gaaaccagtg 


aggtagtgga 


tgtggccaag 


ctcatgaaag 


cctacatcag 


catgatcgtg 


6120 


aagaagcgct 


acagcacgac 


acgctccgcc 


agcagccagg 


gcagctccag 


g 


6171 



Table 10 hMXl polypeptide sequence (SEQ ID NO: 10) 



Phe 

l 


Cys 


Leu 


Gin Gly Thr 
5 


Arg 


Val 


Trp 


Leu 
10 


Arg 


Glu 


Asn 


Gly 


Gin 
15 


His 


Phe 


Pro 


Ser 


Thr Val Asn 
20 


Ser 


Cys 


Ala 
25 


Glu 


Gly 


lie 


Val 


Val 
30 


Phe 




Thr 


Asp 


Tyr 
35 


Gly Gin Val 


Phe 


Thr 
40 


Tyr 


Lys 


Gin 


Ser 


Thr 
45 


lie 


Thr 


His 


Gin 


Lys 


Val 


Thr Ala Met 


His 


Pro 


Thr 


Asn 


Glu 


Glu 


Gly 


Val 


Asp 


Asp 
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50 










55 










60 










Met 
65 


Ala 


Ser 


Leu 


Thr 


Glu 
70 


Leu 


His 


Gly 


Gly 


Ser 
75 


He 


Met 


Tyr 


Asn 


Leu 
80 


Phe 


Gin 


Arg 


Tyr 


Lys 
85 


Arg 


Asn 


Gin 


He 


Trp 
90 


Thr 


Tyr 


He 


Gly 


Ser 
95 


He 


Leu 


Ala 


Ser 


Val 
100 


Asn 


Pro 


Tyr 


Gin 


Pro 
105 


He 


Ala 


Gly 


Leu 


Tyr 
110 


Glu 


Pro 


Ala 


Thr 


Met 
115 


Glu 


Gin 


Tyr 


Ser 


Arg 
120 


Arg 


His 


Leu 


Gly 


Glu 
125 


Leu 


Pro 


Pro 


His 


He 
130 


Phe 


Ala 


He 


Ala 


Asn 
135 


Glu 


Cys 


Tyr 


Arg 


Cys 
140 


Leu 


Trp 


Lys 


Arg 


His 
145 


Asp 


Asn 


Gin 


Cys 


He 
150 


Leu 


He 


Lys 


Gly 


Glu 
155 


Ser 


Gly 


Ala 


Gly 


Lys 
160 


Thr 


Glu 


Ser 


Thr 


Lys 
165 


Leu 


He 


Leu 


Lys 


Phe 
170 


Leu 


Ser 


Val 


He 


Ser 
175 


Gin 


Gin 


Ser 


Leu 


Glu 
180 


Leu 


Ser 


Leu 


Lys 


Glu 
185 


Lys 


Thr 


Ser 


Cys 


Val 
190 


Glu 


Arg 


Ala 


He 


Leu 
195 


Glu 


Ser 


Ser 


Pro 


He 
200 


Met 


Glu 


Ala 


Phe 


Gly 
205 


Asn 


Ala 


Lys 


Thr 


Val 
210 


Tyr 


Asn 


Asn 


Asn 


Ser 
215 


Ser 


Arg 


Phe 


Gly 


Lys 
220 


Phe 


Val 


Gin 


Leu 


Asn 
225 


He 


Cys 


Gin 


Lys 


Gly 
230 


Asn 


He 


Gin 


Gly 


Gly 
235 


Arg 


He 


Val 


Asp 


Cys 
240 


He 


Leu 


Ser 


Ser 


Gin 
245 


Asn 


Arg 


Val 


Val 


Arg 
250 


Gin 


Asn 


Pro 


Gly 


Glu 
255 


Arg 


Asn 


Tyr 


His 


He 
260 


Phe 


Tyr 


Ala 


Leu 


Leu 
265 


Ala 


Gly 


L ™ 


Glu 


His 
270 


Glu 


Glu 


Arg 


Glu 


Glu 
275 


Phe 


Tyr 


Leu 


Ser 


Thr 
280 


Pro 


Glu 


Asn 


Tyr 


His 
285 


Tyr 


Leu 


Asn 


Gin 


Ser 
290 


Gly 


Cys 


Val 


Glu 


Asp 
295 


Lys 


Thr 


He 


Ser 


Asp 
300 


Gin 


Glu 


Ser 


Phe 


Arg 
305 


Glu 


Val 


He 


Thr 


Ala 
310 


Met 


Asp 


Val 


Met 


Gin 
315 


Phe 


Ser 


Lys 


Glu 


Glu 
320 


Val 


Arg 


Glu 


Val 


Ser 
325 


Arg 


Leu 


Leu 


Ala 


Gly 
330 


He 


Leu 


His 


Leu 


Gly 
335 


Asn 


He 


Glu 


Phe 


He 
340 


Thr 


Ala 


Gly 


Gly 


Ala 
345 


Gin 


Val 


Ser 


Phe 


Lys 
350 


Thr 


Ala 


Leu 


Gly 


Arg 
355 


Ser 


Ala 


Glu 


Leu 


Leu 
360 


Gly 


Leu 


Asp 


Pro 


Thr 
365 


Gin 


Leu 


Thr 


Asp 


Ala 
370 


Leu 


Thr 


Gin 


Arg 


Ser 
375 


Met 


Phe 


Leu 


Arg 


Gly 
380 


Glu 


Glu 


He 


Leu 


Thr 
385 


Pro 


Leu 


Asn 


Val 


Gin 
390 


Gin 


Ala 


Val 


Asp 


Ser 
395 


Arg 


Asp 


Ser 




Ala 
400 


Met 


Ala 




Tyr 


Ala 


Cys 


Cys 


Phe 


Glu 


Trp 


Val 


He 


Lys 


Lys 


He 


Asn 
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405 










410 










415 










420 










425 










430 










435 










440 










445 










450 


Tyr 








455 








Tyr 


460 










465 


Ser 








Leu 
470 


Glu 








Glu 
475 


Gly 


Leu 


Val 


Trp 


Glu 
480 


Asp 


lie 


Asp 


Trp 


lie 

485 


Asp 


Asn 


Gly 


Glu 


Cys 
490 


Leu 


Asp 


Leu 


lie 


Glu 
495 


Lys 


Lys 


Leu 


Gly 


Leu 
500 


Leu 


Ala 


Leu 


lie 


Asn 
505 


Glu 


Glu 


Ser 


His 


Phe 
510 


Pro 


Gin 


Ala 


Thr 


Asp 
515 


Ser 


Thr 


Leu 


Leu 


Glu 
520 


Lys 


Leu 


His 


Ser 


Gin 
525 


His 


Ala 


Asn 


Asn 


His 
530 


Phe 


Tyr 


Val 


Lys 


Pro 
535 


Arg 


Val 


Ala 


Val 


Asn 
540 


Asn 


Phe 


Gly 


Val 


Lys 
545 


His 


Tyr 


Ala 


Gly 


Glu 
550 


Val 


Gin 


Tyr 


Asp 


Val 
555 


Arg 


Gly 


lie 


Leu 


Glu 
560 


Lys 


Asn 


Arg 


Asp 


Thr 
565 


Phe 




Asp 


Asp 


570 




Asn 






575 


Glu 


Ser 


Arg 


Phe 


Asp 
580 


Phe 


lie 


Tyr 




585 


Phe 


Glu 


His 


Val 


Ser 
590 


Ser 


Arg 


Asn 


Asn 


Gin 
595 


Asp 


Thr 


Leu 


Lys 


Cys 
600 


Gly 


Ser 




His 


Arg 
605 


Arg 


Pro 


Thr 


Val 


Ser 
610 


Ser 


Gin 


Phe 




Val 
615 


Asp 


Ser 




His 


Ser 
620 


Leu 


Met 


Ala 


Thr 


625 


Ser 




Ser 




630 




Phe 


Val 


Arg 


Cys 
635 


lie 


Lys 


Pro 


Asn 


Met 
640 




Lys 




Pro 


Asp 
645 


Gin 


Phe 


Asp 


Gin 


Ala 
650 


Val 


Val 


Leu 


Asn 


Gin 
655 


Leu 




Tyr 


Ser 


660 




Leu 


Glu 


Thr 


Val 
665 


Arg 


lie 


Arg 


Lys 


Ala 
670 


Gly 


Tyr 






Arg 
675 










680 








Arg 


Tyr 
685 


Lys 


Val 


Leu 


Met 


690 


Asn 


Leu 


Ala 


Leu 


695 


Glu 


Asp 


Val 


Arg 


Gly 
700 


Lys 


Cys 


Thr 


Ser 


Leu 
705 


Leu 


Gin 


Leu 


Tyr 


Asp 
710 


Ala 


Ser 


Asn 


Ser 


Glu 
715 


Trp 


Gin 


Leu 


Gly 


Lys 
720 


Thr 




Val 


Phe 


Leu 
725 


Arg 


Glu 


Ser 


Leu 


Glu 
730 


Gin 


Lys 


Leu 


Glu 


Lys 
735 


Arg 


Arg 


Glu 


Glu 


Glu 
740 


Val 


Ser 


His 


Ala 


Ala 
745 


Met 


Val 


lie 


Arg 


Ala 
750 


His 


Val 




Gly 


Phe 




Ala 


Arg 


Lys 


Gin 


Tyr 


Arg 


Lys 


Val 


Leu 


Tyr 


Cys 


Val 



-31 - 



Val lie lie Gin Lys Asn Tyr Arg Ala Phe Leu Leu Arg Arg Arg Phe 
770 775 780 

Leu His Leu Lys Lys Ala Ala lie Val Phe Gin Lys Gin Leu Arg Gly 
785 790 795 800 

Gin lie Ala Arg Arg Val Tyr Arg Gin Leu Leu Ala Glu Lys Arg Glu 
805 810 815 

Gin Glu Glu Lys Lys Lys Gin Glu Glu Glu Glu Lys Lys Lys Arg Glu 
820 825 830 

Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu Leu Arg 
835 840 845 

Ala Gin Gin Glu Glu Glu Thr Arg Lys Gin Gin Glu Leu Glu Ala Leu 
850 855 860 

Gin Lys Ser Gin Lys Glu Ala Glu Leu Thr Arg Glu Leu Glu Lys Gin 
865 870 875 880 

Lys Glu Asn Lys Gin Val Glu Glu lie Leu Arg Leu Glu Lys Glu lie 
885 890 895 

Glu Asp Leu Gin Arg Met Lys Glu Gin Gin Glu Leu Ser Leu Thr Glu 
900 905 910 

Ala Ser Leu Gin Lys Leu Gin Glu Arg Arg Asp Gin Glu Leu Arg Arg 
915 920 925 

Leu Glu Glu Glu Ala Cys Arg Ala Ala Gin Glu Phe Leu Glu Ser Leu 
930 935 940 

Asn Phe Asp Glu lie Asp Glu Cys Val Arg Asn lie Glu Arg Ser Leu 
945 950 955 960 

Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu 
965 970 975 

Glu Lys Pro Asn Phe Asn Phe Ser Gin Pro Tyr Pro Glu Glu Glu Val 
980 985 990 

Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn 
995 1000 1005 

Pro Ser Glu His Gly His Ser Asp Gin Arg Thr Ser Gly lie Arg 
1010 1015 1020 

Thr Ser Asp Asp Ser Ser Glu Glu Asp Pro Tyr Met Asn Asp Thr 
1025 1030 1035 

Val Val Pro Thr Ser Pro Ser Ala Asp Ser Thr Val Leu Leu Ala 
1040 1045 1050 



Pro Asp Gly Asp Tyr Asp Tyr Asp Gin Asp Asp Tyr Glu Asp Gly 
1085 1090 1095 

Ala lie Thr Ser Gly Ser Ser Val Thr Phe Ser Asn Ser Tyr Gly 
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1100 










1105 










1110 










1115 










Tyr 
1120 










1125 




Tyr 






1130 






Tyr 




1135 










1140 










1145 










1150 










1155 










1160 






Tyr 




1165 








Tyr 


1170 










1175 










1180 










1185 










1190 










1195 


Cys 


Val 




Lys 


Asp 
1200 


Glu 


Thr 


Phe 




1205 










1210 


Glu 


Ala 




Lys 


Gin 
1215 


Gly 


Trp 


Leu 




12 2 0 










1225 










1230 








Lys 


1235 




Trp 






1240 


Arg 


Gin 


Ser 


Lys 


Leu 
1245 


Met 


Tyr 


Phe 




1250 










12 55 






Gly 




12 60 


Glu 


Val 


Arg 




Ala 
1265 


Lys 


Glu 


lie 


lie 


Asp 
1270 


Asn 


Thr 


Thr 




Glu 
1275 


Asn 


Gly 


lie 




1280 










1285 










He 
1290 


Ala 


Glu 


Ser 




1295 










1300 










1305 




Val 






1310 










1315 










1320 










1325 










1330 










Gly 
1335 


Leu 


He 


Asp 




1340 










1345 










1350 










1355 










1360 










1365 








Glu 


13 7 0 






His 


Trp 


1375 


Thr 


Leu 


Leu 


Gin 


1380 


Ser 


Lys 


Gly 


Asp 


Thr 
1385 


Arg 


Val 


Glu 


Gly 


Gin 
1390 


Glu 


Phe 


He 


Val 


Arg 
1395 


Gly 


Trp 


Leu 


His 


Lys 
1400 


Glu 


Val 


Lys 


Asn 


Ser 
1405 


Pro 




Met 


Ser 


Ser 
1410 


Leu 


Lys 


Leu 


Lys 


Lys 
1415 


Arg 


Trp 


Phe 


Val 


Leu 
1420 


Thr 


His 


Asn 


Ser 


Leu 
1425 


Asp 


Tyr 


Tyr 


Lys 


Ser 


Ser 


Glu 


Lys 


Asn 


Ala 


Leu 


Lys 




Gly 


Thr 


Leu 


Val 


Leu 
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1430 










1435 










1440 








Asn 


Ser 
1445 


Leu 


Cys 


Ser 


Val 


Val 
1450 


Pro 


Pro 


Asp 


Glu 


Lys 
1455 


He 


Phe 


Lys 


Glu 


Thr 
1460 


Gly 


Tyr 


Trp 


Asn 


Val 
1465 


Thr 


Val 


Tyr 


Gly 


Arg 
1470 




His 


Cys 


Tyr 


Arg 
1475 


Leu 


Tyr 


Thr 


Lys 


Leu 
1480 


Leu 


Asn 


Glu 


Ala 


Thr 
1485 


Arg 


Trp 


Ser 


Ser 


Val 
1490 


He 


Gin 


Asn 


Val 


Thr 
1495 


Asp 


Thr 


Lys 


Ala 


Pro 
1500 


He 


Asp 


Thr 


Pro 


Thr 
1505 


Gin 


Gin 


Leu 


He 


Gin 
1510 


Asp 


He 


Lys 


Glu 


Asn 
1515 


Cys 


Leu 


Asn 


Ser 


Asp 
1520 


Val 


Val 


Glu 


Gin 


He 
1525 


Tyr 


Lys 


Arg 


Asn 


Pro 
1530 


He 


Leu 


Arg 


Tyr 


Thr 
1535 


His 


His 


Pro 


Leu 


His 
1540 


Ser 


Pro 


Leu 


Leu 


Pro 
1545 


Leu 


Pro 


Tyr 


Gly 


Asp 
1550 


He 


Asn 


Leu 


Asn 


Leu 
1555 


Leu 


Lys 


Asp 


Lys 


Gly 
1560 


Tyr 


Thr 


Thr 


Leu 


Gin 
1565 


Asp 


Glu 


Ala 


He 


Lys 
1570 


He 


Phe 


Asn 


Ser 


Leu 
1575 


Gin 


Gin 


Leu 


Glu 


Ser 
1580 


Met 


Ser 


Asp 


Pro 


He 
1585 


Pro 


He 


He 


Gin 


Gly 
1590 


He 


Leu 


Gin 


Thr 


Gly 
1595 


His 


Asp 


Leu 


Arg 


Pro 
1600 


Leu 


Arg 


Asp 


Glu 


Leu 
1605 


Tyr 


Cys 


Gin 


Leu 


He 
1610 


Lys 


Gin 


Thr 


Asn 


Lys 
1615 


Val 


Pro 


His 


Pro 


Gly 
1620 


Ser 


Val 


Gly 


Asn 


Leu 
1625 


Tyr 


Ser 


Trp 


Gin 


He 
1630 


Leu 


Thr 


Cys 


Leu 


Ser 
1635 


Cys 


Thr 


Phe 


Leu 


Pro 
1640 


Ser 


Arg 


Gly 


He 


Leu 
1645 


Lys 


Tyr 


Leu 


Lys 


Phe 
1650 


His 




Lys 


Arg 


He 
1655 


Arg 


Glu 


Gin 


Phe 


Pro 
1660 


Gly 


Thr 


Glu 


Met 


Glu 
1665 


Lys 


Tyr 


Ala 


Leu 


Phe 
1670 


Thr 


Tyr 


Glu 


Ser 


Leu 
1675 


Lys 


Lys 


Thr 


Lys 


Cys 
1680 


Arg 


Glu 


Phe 


Val 


Pro 
1685 


Ser 


Arg 


Asp 


Glu 


He 
1690 


Glu 


Ala 


Leu 


He 


His 
1695 


Arg 


Gin 


Glu 


Met 


Thr 
1700 


Ser 


Thr 


Val 


Tyr 


Cys 
1705 


His 


Gly 


Gly 


Gly 


Ser 
1710 


Cys 


Lys 


He 


Thr 


He 
1715 


Asn 


Ser 


His 


Thr 


Thr 
1720 


Ala 


Gly 


Glu 


Val 


Val 
1725 


Glu 


Lys 


Leu 


He 


Arg 
1730 


Gly 


Leu 


Ala 


Met 


Glu 
1735 


Asp 


Ser 


Arg 


Asn 


Met 
1740 


Phe 


Ala 


Leu 


Phe 


Glu 
1745 


Tyr 


Asn 


Gly 


His 


Val 
1750 


Asp 


Lys 


Ala 


He 


Glu 
1755 


Ser 


Arg 


Thr 


Val 


Val 


Ala 


Asp 


Val 


Leu 


Ala 


Lys 


Phe 


Glu 




Leu 


Ala 


Ala 


Thr 
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1760 










1765 










1770 










1775 




Gly Asp 




1780 










1785 










1790 




Asp 


Thr 




1795 










1800 








Ala 


Phe 
1805 


Met 


Phe 


Glu 


Gin 


Ala 
1810 


His 


Glu 


Ala 


Val 


He 
1815 


His 


Gly 


His 


His 


Pro 
1820 


Ala 


Pro 


Glu 


Glu 


Asn 
1825 


Leu 


Gin 


Val 


Leu 


Ala 
1830 


Ala 


Leu 


Arg 




Gin 
1835 


Tyr 


Leu 


Gin 


Gly 


Asp 
1840 


Tyr 


Thr 


Leu 


His 


Ala 
1845 


Ala 


He 


Pro 


Pro 


Leu. 
1850 


Glu 


Glu 


Val 


Tyr 


Ser 
1855 


Leu 


Gin 


Arg 


Leu 


Lys 
1860 


Ala 


Arg 


He 


Ser 


Gin 
1865 


Ser 


Thr 


Lys 


Thr 


Phe 
1870 


Thr 


Pro 


Cys 


Glu 


Arg 
1875 


Leu 


Glu 


Lys 


Arg 


Arg 
1880 


Thr 


Ser 


Phe 


Leu 


Glu 
1885 


Gly 


Thr 


Leu 


Arg 


Arg 
1890 


Ser 


Phe 


Arg 


Thr 


Gly 
1895 


Ser 


Val 


Val 


Arg 


Gin 
1900 


Lys 


Val 


Glu 


Glu 


Glu 
1905 


Gin 


Met 


Leu 


Asp 


Met 
1910 


Trp 


He 


Lys 


Glu 


Glu 
1915 


Val 


Ser 


Ser 


Ala 


Arg 
1920 


Ala 


Ser 


He 


lie 


Asp 
1925 


Lys 


Trp 


Arg 


Lys 


Phe 
1930 


Gin 


Gly 


Met 


Asn 


Gin 
1935 


Glu 


Gin 


Ala 


Met 


Ala 
1940 


Lys 


Tyr 


Met 


Ala 


Leu 
1945 


He 


Lys 


Glu 


Trp 


Pro 
1950 


Gly 


Tyr 


Gly 


Ser 


Thr 
1955 


Leu 


Phe 


Asp 


Val 


Glu 
1960 


Cys 


Lys 


Glu 


Gly 


Gly 
1965 


Phe 


Pro 


Gin 


Glu 


Leu 
1970 


Trp 


Leu 


Gly 


Val 


Ser 
1975 


Ala 


Asp 


Ala 


Val 


Ser 
1980 


Val 


Tyr 


L " 


Arg 


Gly 
1985 


Glu 


Gly Arg 


Pro 


Leu 
1990 


Glu 


Val 


Phe 


Gin 


Tyr 
1995 


Glu 


His 


He 


Leu 


Ser 


Phe 


Gly 


Ala 


Pro 


Leu 


Ala 


Asn 


Thr 


Tyr 


Lys 


He 


Val 


Val 


Asp 


Glu 
2015 


Arg 


Glu 


Leu 


Leu 


Phe 
2020 


Glu 


Thr 


Ser 


Glu 


Val 
2025 


Val 


Asp 


Val 


Ala 


Lys 
2030 


Leu 


Met 


Lys 


Ala 


Tyr 
2035 


He 


Ser 


Met 


He 


Val 
2040 


Lys 


Lys 


Arg 


Tyr 


Ser 
2045 


Thr 


Thr 


Arg 


Ser 


Ala 
2050 


Ser 


Ser 


Gin 


Gly 


Ser 
2055 


Ser 


Arg 





hMX2 nucleotide sequence (SEQ ID NO:l 1) 



Table 11 



agctagtatc ttttattgtc agaacttctg tgagccaaca aacagttttg catggttgta 



60 
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cacaaaggga caaggcaaat ttcttttttc gtgtgggtag acttagttgg cccaagtcct 120 

taaaactttt ccatataaaa ataaaaagtc caagaccaga ttatttttct tctggtcata 180 

aatgctgatt tatttacagg tgccttgttc agaccaccat tataaacttg ggataaaata 240 

tgtgtgtatt aaagcctcag catttaatgt cagggtcctt tgaagattca ctcaagtgtt 300 

aagacgtttc tggaatgcag cgtctctccc ccatagtcaa catggttatt atatctgtaa 360 

tctatccaga atgatagaag ctaaccttcc aagtaacact ttgtttttaa cttaaatctt 42 0 

ttagacatga aagactccaa aatgacttca ttcttgttct aaaaccagca ctggagccag 480 

ctgttgaaga gtggtttata aatacagtta tcttgtaggc tgcttatctg tttataatac 540 

agcagacaca gatggcagac tttgctacat gtaaaacaat ggagtcaaca cgtgtttttc 600 

aaaatacagc aaagacagga aaatccagga tttgggtttg ttaataaaac caccttataa 660 

agtaacaatt gagactatag ctctgcatta ttaaaatata cagactgtgt acaccattac 72 0 

acatcctttt tccctttgct ttttaatgct catgaaacca tgattaaggt gttgagttta 780 

tgaacacatg cacgaacagg caagcacgta cacttaaaag atgaaacaaa gaaaaaagtt 84 0 

gattcatgtc attccatgag aaaggctgcc cgcagcactc cagctcaaac acactgtccc 900 

ctcgagctct ccatccccct tcccactccc tcaccttccc tcagattcgg ggaaatcagg 960 

ttgggaggtt agtgcatcat tgacagagaa tgcccccctt ccacgctctg ttaagtctcc 102 0 

cccagaaggg ggaaaggcag ttcccttcag tagcacagtt acggtcgatt agtgttggtt 1080 

ccacaagtta aggcacttcc ggctgctttg gtggcagcgt ggttcctccc ctcctttttt 1140 

aaggcatgtg tcctctaaga gtagtaaagc tttggaaact gtgcagactg ttaaagttga 12 0 0 

cagcttaata caggatcaat gaaggcggca ggcaaaagga tcctcggaga cacctccctc 12 60 

agaccagaag cttccagaaa gcctgggcag ctctgtgttt gttttggctg ggcatggcac 132 0 

actggagcca gcctaggcca gagggtggtg cgttcaggta gcaaagacag gtgggctctg 1380 

tcccgccttc acctggagct gccctggctg ctggcggagc gtgtcgtgct gtagcgcttc 1440 

ttcacgatca tgctgatgta ggctttcatg agcttggcca catccaccac ctcactggtt 1500 

tcaaagagca gctccctctc atcgaccacg atcttatacg tattcgccag gggtgcccca 1560 

aaagagagga tgtgttcata ctggaagact tccagtggtc ttccctctcc acgcttgtag 162 0 

acggagacgg cgtccgcgct gacacccaac cagagttcct gagggaagcc accttccttg 1680 

cactggtggg caagagtcaa cagaagagtt aagtcatgaa gtggttggca acagaaagca 1740 

tctaaaccat aagacaggct ttgagtgaag tcctctgtgc agaagattaa atatattcga 1800 

tgtgcatgca tgcatggagg ggcctgaaat atgaaaaatg gcacctctct ggctatcttg 1860 

atttctaact agttaatctc acgcttttgg gaaaacctca ctaactggca gagtctaaca 1920 

tcttgctttg actctccact tctcagcatt attctactag ctgtttggat tagctacgtg 1980 

gaagtggcct ggaaacgtac atgcttggcc gggggactta agaaagcttc cctgcaaccc 2040 

aagccaagtc tactcttgta ttaatatctc cagttctgcc tccaatcctc tttgcggatg 2100 

gttagtcttc aaatacaaaa tctaggatca cagaggaaaa ttctccaaat cacgcatgct 2160 

gagcagttct ggctcctctt cacaaggagc agcaatggcc ttccatatgc agagtgggaa 222 0 

cagggacttt accagtttaa ctgtagactt tcctgtacag attggtggaa gaaaataaga 2280 

ccccatatga aggggctaac aacacagggc tgatccaaac ctggacaagc aggagggcta 234 0 

taaattggag acgctgaaaa gagtctctag tttatatccc taataaccag acattctctg 2400 

catcctccat gcaaaagcca gtagctttct ttttttcttt ttttttttga cggagtctca 2460 

ctctgtcgcc caggctggag tgcagtggcg tgatcctggc tcactgcaac ctccacctcc 252 0 

tgagttcaag cgattctcct gtctcagcct ctcgagtagc tgggattaca ggtgcatgcc 2580 

accacgccca gctaattttt tgcagagatg gggtttcacc gtgttagcca ggatggtctc 2640 

gatctcctga cctcatgatc cgcccgcctt ggcctcccaa agcgctggga ttacaggcat 2700 
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gagccaccgc gcccggccaa gccagtagct ttctatgcta attcacagct cacgttttgc 27 60 

aggaagccaa gagtttaact gctattatct attccttgtc agggagaaat ggaattatgg 2820 

ctttgtacaa agcacctgat ttttttatac ttaaaaacag gcataattga accaaccaaa 2880 

ccaatcaaaa acatcaccta atgaaaagcc acccacggat tctagaattt ataatattta 2940 

gaattttata cagcctcaat ataaagtcat cagatatacg ctgaattact gtgatcataa 3000 

aaaatggaag ctaatctaga cgatgagctg gcacacttat ctgttaaggg ctgcatagta 3060 

aagattttta gactttgtgg atcacatggt ctctgtcaca actactcaac tctgtacaaa 3120 

aacagccagg gaatatatct aaaggaatga gcttggctgt gttccaataa aactttgttt 3180 

agaaaaaaag gaggcaggca agatctgacc cacagaccag tttgccaaac tctcatctag 3240 

acaattagta agatttcttt tcaataagcg gtctacttaa aacaaaacaa aaatcagtac 3300 

tgggttgatg ccaatggcta aattccatta cgagatagac attcttcctt tcaaacagat 3360 

ggctgtaaag aaaaaacaaa gtaaaatgca agtatatcca aagtttctaa tttgtatata 3420 

cagctataac atttttttaa atgtagattt ttatcagtgt ttaaaaaatt agatctatag 3480 

cttccctaag gaagggtaga agaatagatg acatcttaat tttgcattca ttcctaatat 3540 

tacagatgca tttactacac aggagaagag aaactgtgag gagaagggag gcgttaatgg 3600 

tacaattttg ggggctcgaa aaaaagaggt tgagagagca aaatgctcca tcttgtcttc 3660 

tctccacatg aacttggccg tgatccatgt tctcagatgc cagcacccag cccaccccaa 3720 

cacatggcag ccagttctca cctccacatc aaacagcgtc gagccatagc caggccactc 3780 

cttgatcaag gccatgtact tggccatggc ctgttcctgg ttcattccct gaaatttcct 3840 

ccacttgtca atgatactgg ctcgagcaga ggagacttct tccttaatcc acatgtccag 3900 

catctgctcc tcctcgacct tctgccggac cacggatcct gtccggaagc tccgcctcag 3960 

ggtcccctct aggaagctcg tccgcctctt ctccagccgt tcacaagggg tgaaggtttt 4020 

ggttgactgg ctgatgcggg ccttgagtct ctgcagggaa taaacctctt cgagaggtgg 4080 

gatggcagcg tgcagagtat aatccccctg cagatactgg agtcgcaggg cagcaagaac 4140 

ctggaggttt tcttccgggg ctggatggtg gccatggata accgcttcgt gggcctgttc 4200 

aaacataaat gcaaactcca cactgtcttt tggcacgttg tctgtgtcca ggaagcagta 4260 

aagtttgaag tagaatttcc atggcaggtc cccaacctcg gatgtggcag ccagcttttc 4320 

aaacttggct aagacatcag ctacgacggt tcgactttca atggctttgt cgacgtggcc 4380 

gttgtattca aacaaagcaa acatgttcct gctgtcctcc atggccaggc ctcggatcag 4440 

cttctccacc acctccccag cggtggtgtg ggagttgatg gtgatcttgc aggagccgcc 4500 

gccatggcaa tagaccgtgg atgtcatttc ctgcctgtgg atcagagctt ctatttcatc 4560 

tcgggaaggc acaaactctc ggcatttggt tttcttaaga gattcgtaag tgaagagagc 4620 

gtatttttcc atctcggttc ctggaaactg ttcccgtatc cttttcagat ggaacttgag 4680 

atacttgaga atccctcgac tcggcaggaa ggtgcagctc aggcatgtca ggatctgcca 4740 

gctgtacagg ttgcccacac tgccggggtg gggcactttg ttggtctgtt tgataagctg 4800 

gcagtacagc tcgtcccgca gaggtcgcag gtcatgccct gtctgtagga tgccctggat 4860 

tattggaatt gggtcagaca tggactccag ttgctgcagg gaattgaata tcttgatggc 4920 

ctcatcctga agggtggtat agcctttgtc tttcagcaag ttgagattta tgtccccata 4980 

cggaaggggc aggagcgggg agtgcaaggg gtgatgggtg tatcgaagga tcgggttccg 5040 

cttgtaaatc tgttccacca catccgagtt caggcagttc tccttgatat cttgaatcag 5100 

ctgctgggtg ggggtgtcga tcggggcctt ggtgtcagtc acgttttgaa tgacactgga 5160 

ccaccgggtg gcctcgttga gcagcttggt gtagagccgg taacagtgct tgcgcccgta 5220 

cacggtgacg ttccagtagc ctgtctcttt gaatatcttc tcatctgggg ggacgacaga 5280 

gcagaggctg ttgaggacca gggtccccag tttgagcgcg ttcttctctg aactcttgta 5340 
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gtaatccagg gaattgtggg tgagtacaaa ccaccgtttc ttcagtttca gtgaagacat 5400 

ctttggactg ttcttcacct ctttgtgcaa ccatcctctc acgatgaatt cctggccctc 5460 

cactctggtg tcccctttgg acctctgcag cagggttatc cagtggtgca tctcctccgg 5520 

cgtgtcggcg ttgcagtgca gcacccggtt ggccgtgatg atcacaaacg agttgggtct 5580 

atcagggctg tcagaggcac acacagaatc aatcagcccc acatccaagg tgcccacagc 5640 

attctgtggg tttgcctgct catcatgcat ctcctggatc tcctggtccg tggacgcgtg 5700 

gacctgactc agcacgctga accactggct ggcatcttct ggggactctg caatcaggtg 5760 

gaaagtccta tcggccataa tgatgtcgat cccattctcc ttggtggtgt tatctatgat 5820 

ctcttttgcc gttcgcactt ctacggtgcc cttgagcttc tcctcgctgt cgttttcaaa 5880 

gtacatcagc ttggactggc ggaggacaaa ccagcgcttc ttccaatttc tcctggacag 5940 

cgtggaggag cccccccctt ttttgtggag ccagccttgc ttgagggcct cctgcttgga 6000 

gcggaaccac aagaaggttt catccttgag gacgcaccag cggcgtttcc aagagttcat 6060 

caggccacct ttcatgtaca gaaagctgtg gaaatacggc agagtgacac agctgtacac 612 0 

agagtcacgc cggtatgaaa gctcatcatc tgtatcaaac ctggaatcaa agtcctcttc 6180 

actatcttca aacgaggact gcgccccctc agagctgaac cggtaggcac ccgagctgtt 6240 

gtaggtcccc acagagcagc ggtagtcggg ggaccactgg ctgccgtagg agttggagaa 63 00 

ggtcacgctg ctgccggaag tgatggcacc gtcctcatag tcatcctggt cgtagtcgta 6360 

gtcgccgtct ggggagggca agtccccagc gttctggggc atgcagtagg tggactcgcc 6420 

gctggaggag ttgtgtaggc tcccggagtc ctgcactgat ggggcgagca gcaccgtgct 6480 

gtccgcactg gggctggtgg gcaccaccgt gtcgttcatg tatgggtcct cctctgaaga 6540 

gtcatcgctg gtccggatgc cacttgttcg ctggtctgag tggccgtgct cgctggggtt 6600 

gggggagtcc ttgaaggcgt cgtcgtcggc ttcgaagccc tcatcgacct cctcctctgg 6660 

gtagggctgg ctgaagttga agttgggctt ctcctcgcat gcgctctcag ccagctcgct 6720 

ggaaaattcg cttccccccg acagggaccg ctcgatattc cggacacact cgtcgatctc 6780 

gtcgaaattg agggactcga ggaactcctg ggccgccctg cacgcttcct cctccagcct 6840 

gcggagctcc tggtcccgcc gctcctgcag cttctgcagg gaagcctcgg tcagcgacag 6900 

ctcctgctgc tccttcatgc gctgcaggtc ctcgatttct ttctccagac ggaggatctc 6960 

ttccacctgc ttattttcct tctgtttctc cagttcacgg gtcagttcag cttccttctg 7020 

gctcttctgc aaggcttcga gttcttgctg cttcctcgtt tcttcttcct gctgggcgcg 7080 

gagctcggct tctcttcgct ctctctctct ttctctttct tcttcctccc gtttcttctt 7140 

ttcttcctct tcctgtttct tcttttcttc ttgctccctt ttctctgcca gcaattgtct 7200 

gtaaactctc cgagcaatct gacctctgag ttgcttctgg aaaactatgg ctgccttttt 7260 

caggtgcaaa aatctcctcc tcagaaggaa tgctctgtaa ttcttctgta ttatcaccac 7320 

acaataaagg acctttctgt attgtttccg tgctaagaag cccaagacat gggcccgaat 7380 

caccatggcc gcgtggctca cttcctcttc cctccgcttc tccagtttct gttccaagga 7440 

ttctcgaaga aataccttgg tcttccccag ctgccactcg ctgttggagg catcatagag 7 500 

ctgcagcagg ctcgtgcact tccctcggac gtcctcaggc agagccagat tcctcatcag 7 560 

cactttatac cttttgtaaa agtcctgaaa gggtcttcgg accgcatacc cagctttgcg 7620 

gattctcaca gtctccagca tccctgagta ccgcagctgg ttcagcacaa ccgcctggtc 7680 

aaactggtct ggcatcttct gcatgtttgg cttgatacag cgaacaaaga aaggattaga 7740 

ggagcttagc gttgccatta aggaatgcag tgagtcaacc ttgaactgtg agctgactgt 7800 

aggccgccga tgtttgcttc cacatttcaa ggtatcctgg ttgttgcggc ttgaaacatg 7860 

ttcaaaaaga tcgtagataa agtcaaaccg gctttctctt agcaaattga gaaggtcatc 7920 

tcgaaatgta tctctgttct tctccaagat acctcggaca tcatattgca cctctccagc 7980 
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atagtgcttc 


actccaaaat 


tgttaactgc 


aactctgggc 




agtggttatt 


8040 


cgcatgctga 


ctgtgtagct 


tctccaataa 


ggtgctgtct 


gtggcttgag 


gaaaatggct 


8100 


ttcttcattg 


ataagggcta 


ggaggccaag 


tttcttctca 


atcaagtcca 


ggcattctcc 


8160 


attgtctatc 


cagtcaatat 


cttcccacac 


taatccttcc 


ctgctatatt 


ctagttgttc 


8220 


taaagaaaaa 


atatgcttgt 


tgaagtactc 


ctgaagtttc 


tcgtttgcat 


agtttatatt 


8280 


gaactgttca 


aagtgattaa 


cctcaaagt t 


ttcaaatcca 


aagatgtcga 


ggatgccaat 


8340 


agacttgaag 


tcctcattgc 


ctttgatcct 


gctgttgatc 


ttcttgatta 


cccactcaaa 


8400 


gcagcacgca 


tacagagcca 


tggccaggga 


gtccctgctg 


tctactgcct 




8460 












aagcltctgt 




gagctgtgtt 


gggtccagcc 


caagtaactc 


cgcagatctg 




ttttTaa & a 


8580 


aacctgtgcc 


ccaccagcag 


tgataaattc 




aga gcag a 


tacca^caeT 
accagcaag 


8640 


cagcctcgac 


acttcccgaa 


cttcctcctt 


gctgaactgc 


a cacg cca 


gccg aa 


8700 








tgt cttcftct 






8760 


caagtagtgg 


tagttttctg 


gcgtagatla 




tctctttctt 


cat^tcca 


8820 


ccctgccagc 


a£ ^ g ^ aaa 


atatgtgata 


attcctttcc 


ccg^atttt 


gccttactac 


8880 


t c cfcf 1 1 c t cr g - 


gaagagagga 


tacaatctac 


aattctccc C 
aa c cccg 


ccctgaatat 


ttcctttctg 


8940 


acagatgttc 


agctgaacaa 




gcgactagag 




acacggtctt 


9000 




aaagcttcca 




gctttcaaga 


atagctcgtt 


caacacagga 


9060 


Ttcttctcc 




attccaalga 








9120 


gatcaattta 


gtgctttcgg 


ttttacctgc 


cccactttca 


cccttgatga 


ggatgcactg 


9180 


gttgtcgtgg 


cgcttccaca 


ggcagcggta 


gcactcgttg 


gcgatggcga 


agatgtgcgg 


9240 


gggcagctcg 


cccaggtggc 


gccggctgta 


ctgctccatg 


gtggcaggct 


cgtacagccc 


9300 


ggcgatgggc 


tggtaggggt 


tcacagaggc 


caggatggag 


ccgatgtagg 


tccatatttg 


9360 


atttctctta 


taccgctgga 


ataagttata 


catgatggag 


ccgccatgga 


gctctgtcaa 


9420 


ggacgccatg 


tcatccacgc 


cctcctcgtt 


cgtggggtgc 


atagcagtca 


ccttctggtg 


9480 


ggtaattgtg 


ctctgcttgt 


aagtgaatac 


ctgaccatag 


tctgtccgga 


agacgacgat 


9540 


gccttctgca 


caggaattta 


cagtacttgg 


aaaatgctgg 


ccattttctc 


tcagccagac 


9600 


ccgtgttccc 


tgtaaacaaa 


a 








9621 



Table 12 hMX2 polypeptide sequence (SEQ ID NO : 1 2) 



Phe 
1 


Cys 


Leu 


Gin 


Gly 
5 


Thr 


Arg 


Val 


Trp 


Leu 
10 


Arg 


Glu 


Asn 


Gly 


Gin 
15 


His 


Phe 


Pro 


Ser 


Thr 
20 


Val 


Asn 


Ser 


Cys 


Ala 
25 


Glu 


Gly 


He 


Val 


Val 
30 


Phe 


Arg 


Thr 


Asp 


Tyr 
35 


Gly 


Gin 


Val 


Phe 


Thr 
40 


Tyr 


Lys 


Gin 


Ser 


Thr 
45 


He 


Thr 


His 


Gin 


Lys 
50 


Val 


Thr 


Ala 


Met 


His 
55 


Pro 


Thr 


Asn 


Glu 


Glu 
60 


Gly 


Val 


Asp 


Asp 


Met 
65 


Ala 


Ser 


Leu 


Thr 


Glu 
70 


Leu 


His 


Gly 


Gly 


Ser 
75 


He 


Met 


Tyr 


Asn 


Leu 
80 


Phe 


Gin 


Arg 


Tyr 


Lys 
85 


Arg 


Asn 


Gin 


He 


Trp 
90 


Thr 


Tyr 


He 


Gly 


Ser 
95 


He 


Leu 


Ala 


Ser 


Val 


Asn 


Pro 


Tyr 


Gin 


Pro 


He 


Ala 


Gly 


Leu 


Tyr 


Glu 


Pro 
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100 










105 










110 










115 










120 






Leu 




Glu 
125 


Leu 


Pro 


Pro 




130 










135 








Arg 


140 






Lys 


Arg 


145 








Cys 


He 
150 




He 


Lys 


Gly 


Glu 
155 


Ser 


Gly 


Ala 


Gly Lys 
160 


Thr 


Glu 


Ser 


Thr 


Lys 
165 


Leu 


He 


Leu 


Lys 


Phe 
170 




Ser 


Val 


He 


Ser 


Gin 


Gin 


Ser 


Leu 


Glu 
180 


Leu 


Ser 


Leu 


Lys 


Glu 
185 


Lys 


Thr 


Ser 


Cys 


Val 
190 


Glu 


Arg 


Ala 


He 


Leu 
195 


Glu 


Ser 


Ser 


Pro 


He 
200 


Met 


Glu 


Ala 


Phe 


Gly 
205 


Asn 


Ala 


Lys 


Thr 


Val 
210 


Tyr 


Asn 


Asn 


Asn 


Ser 
215 


Ser 


Arg 


Phe 


Gly 


Lys 
220 


Phe 


Val 


Gin 


Leu 


Asn 
225 


He 


Cys 


Gin 


Lys 


Gly 
230 


Asn 


He 


Gin 


Gly 


Gly 
235 


Arg 


He 


Val 


Asp 


Cys 
240 


lie 


Leu 


Ser 


Ser 


Gin 
245 


Asn 


Arg 


Val 


Val 


Arg 
250 


Gin 


Asn 


Pro 


Gly 


Glu 
255 


Arg 


Asn 


Tyr 


His 


He 
260 


Phe 


Tyr 


Ala 


Leu 


Leu 
265 


Ala 


Gly 


Leu 


Glu 


His 
270 


Glu 


Glu 


Arg 


Glu 


Glu 
275 


Phe 


Tyr 


Leu 


Ser 


Thr 
280 


Pro 


Glu 


Asn 


Tyr 


His 
285 


Tyr 


Leu 


Asn 


Gin 


Ser 
290 


Gly 


Cys 


Val 


Glu 


Asp 
295 


Lys 


Thr 


He 


Ser 


Asp 
300 


Gin 


Glu 


Ser 


Phe 


Arg 
305 


Glu 


Val 


He 


Thr 


Ala 
310 


Met 


Asp 


Val 


Met 


Gin 
315 


Phe 


Ser 


Lys 


Glu 


Glu 
320 


Val 


Arg 


Glu 


Val 


Ser 
325 


Arg 


Leu 


Leu 


Ala 


Gly 
330 


He 


Leu 


His 


Leu 


Gly Asn 
335 




Glu 


Phe 


He 
340 


Thr 


Ala 


Gly 


Gly 


Ala 
345 


Gin 


Val 


Ser 


Phe 


Lys 
350 


Thr 


Ala 






355 






Glu 


Leu 


Leu 
360 


Gly 


Leu 


Asp 


Pro 


Thr 
365 


Gin 


Leu 


Thr 


Asp 


Ala 
370 


Leu 


Thr 


Gin 


Arg 


Ser 
375 


Met 


Phe 


Leu 


Arg 


Gly 
380 


Glu 


Glu 


He 


Leu 


385 


Pro 


Leu 


Asn 


Val 


390 


Gin 


Ala 


Val 


Asp 


Ser 
395 


Arg 


Asp 


Ser 


Leu 


Ala 
400 


Met 


Ala 


Leu 


Tyr 


Ala 
405 


Cys 


Cys 


Phe 


Glu 


Trp 
410 


Val 


He 


Lys 


Lys 


He 
415 


Asn 


Ser 


Arg 


He 


Lys 
420 


Gly 


Asn 


Glu 


Asp 


Phe 
425 


Lys 


Ser 


He 


Gly 


He 
430 


Leu 


Asp 


He 


Phe 


Gly 
435 


Phe 


Glu 


Asn 


Phe 


Glu 
440 


Val 


Asn 


His 


Phe 


Glu 
445 


Gin 


Phe 


Asn 


He 


Asn 


Tyr 


Ala 


Asn 


Glu 


Lys 


Leu 


Gin 


Glu 


Tyr 


Phe 


Asn 


Lys 


His 


He 
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450 










455 










460 










465 










470 








Arg 


475 










480 


Asp 


lie 


Asp 


Trp 


He 
485 


Asp 


Asn 


Gly 


Glu 


Cys 
490 


Leu 


Asp 


Leu 


He 


Glu 
495 


Lys 


Lys 


Leu 


Gly 


Leu 
500 


Leu 


Ala 


Leu 


He 


Asn 
505 


Glu 


Glu 


Ser 


His 


Phe 
510 


Pro 


Gin 


Ala 


Thr 


Asp 
515 


Ser 


Thr 


Leu 


Leu 


Glu 
520 


Lys 


Leu 


His 


Ser 


Gin 
525 


His 


Ala 


Asn 


Asn 


His 
530 


Phe 


Tyr 


Val 


Lys 


Pro 
535 


Arg 


Val 


Ala 


Val 


Asn 
540 


Asn 


Phe 


Gly Val 


Lys 
545 


His 


Tyr 


Ala 


Gly 


Glu 
550 


Val 


Gin 


Tyr 


Asp 


Val 
555 


Arg 


Gly 


He 


Leu 


Glu 


Lys 


Asn 


Arg 


Asp 


Thr 
565 


Phe 


Arg 


Asp 


Asp 


Leu 
570 


Leu 


Asn 


Leu 


Leu 


Arg 


Glu 


Ser 


Arg 


Phe 


Asp 
580 


Phe 


He 


Tyr 


Asp 


Leu 
585 


Phe 


Glu 


His 


Val 


Ser 
590 


Ser 


Arg 


Asn 


Asn 


Gin 
595 


Asp 


Thr 


Leu 


Lys 


Cys 
600 


Gly 


Ser 


Lys 


His 


Arg 
605 


Arg 


Pro 


Thr 


Val 


Ser 
610 


Ser 


Gin 


Phe 


Lys 


Val 
615 


Asp 


Ser 


Leu 


His 


Ser 
620 


Leu 


Met 


Ala 


Thr 


Leu 
625 


Ser 


Ser 


Ser 


Asn 


Pro 
630 


Phe 


Phe 


Val 


Arg 


Cys 
635 


He 


Lys 


Pro 


Asn 


Met 
640 


Gin 


Lys 


Met 


Pro 


Asp 
645 


Gin 


Phe 


Asp 


Gin 


Ala 
650 


Val 


Val 






Gin 
655 




Arg 


Tyr 


Ser 


Gly 
660 


Met 


Leu 


Glu 


Thr 


Val 
665 


Arg 


He 


Arg 


Lys 


Ala 
670 


Gly 


Tyr 


Ala 


Val 


Arg 
675 


Arg 


Pro 


Phe 


Gin 


Asp 
680 


Phe 


Tyr 


Lys 


Arg 


Tyr 
685 


Lys 


Val 


Leu 


Met 


Arg 
690 


Asn 


Leu 


Ala 


Leu 


Pro 
695 


Glu 


Asp 


Val 


Arg 


Gly 
700 


Lys 


Cys 


Thr 


Ser 


Leu 
705 


Leu 


Gin 


Leu 


Tyr 


Asp 
710 


Ala 


Ser 


Asn 


Ser 


Glu 
715 


Trp 


Gin 


Leu 


Gly 


720 


Thr 


Lys 


Val 


Phe 


Leu 
725 


Arg 


Glu 


Ser 


Leu 


Glu 
730 


Gin 


Lys 


Leu 


Glu 


Lys 
735 


Arg 


Arg 


Glu 


Glu 


Glu 
740 


Val 


Ser 


His 


Ala 


Ala 
745 


Met 


Val 


He 


Arg 


Ala 
750 


His 


Val 


Leu 


Gly 


Phe 
755 


Leu 


Ala 


Arg 




Gin 
760 


Tyr 


Arg 


Lys 


Val 


Leu 
765 


Tyr 


Cys 


Val 


Val 


lie 
770 


He 


Gin 


Lys 


Asn 


Tyr 
775 


Arg 


Ala 


Phe 


Leu 


Leu 
780 


Arg 


Arg 


Arg 


Phe 


Leu 
785 


His 


Leu 


Lys 


Lys 


Ala 
790 


Ala 


He 


Val 


Phe 


Gin 
795 


Lys 


Gin 


Leu 


Arg 


Gly 
800 


Gin 


He 


Ala 






Val 


Tyr 


Arg 


Gin 


Leu 


Leu 


Ala 


Glu 


Lys 


Arg 


Glu 
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Gin' Glu Glu Lys Lys Lys Gin Glu Glu Glu Glu Lys Lys Lys Arg Glu 
820 825 830 

Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu Leu Arg 
835 840 845 

Ala Gin Gin Glu Glu Glu Thr Arg Lys Gin Gin Glu Leu Glu Ala Leu 
850 855 860 

Gin Lys Ser Gin Lys Glu Ala Glu Leu Thr Arg Glu Leu. Glu Lys Gin 
865 870 875 880 

Lys Glu Asn Lys Gin Val Glu Glu lie Leu Arg Leu Glu Lys Glu lie 
885 890 895 



Ala Ser Leu Gin Lys Leu Gin Glu Arg Arg Asp Gin Glu Leu Arg Arg 
915 920 925 

Leu Glu Glu Glu Ala Cys Arg Ala Ala Gin Glu Phe Leu Glu Ser Leu 
930 935 940 

Asn Phe Asp Glu lie Asp Glu Cys Val Arg Asn lie Glu Arg Ser Leu 
945 950 955 960 

Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu 
965 970 975 

Glu Lys Pro Asn Phe Asn Phe Ser Gin Pro Tyr Pro Glu Glu Glu Val 
980 985 990 

Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn 
995 1000 1005 

Pro Ser Glu His Gly His Ser Asp Gin Arg Thr Ser Gly lie Arg 
1010 1015 1020 

Thr Ser Asp Asp Ser Ser Glu Glu Asp Pro Tyr Met Asn Asp Thr 
1025 1030 1035 

Val Val Pro Thr Ser Pro Ser Ala Asp Ser Thr Val Leu Leu Ala 
1040 1045 1050 

Pro Ser Val Gin Asp Ser Gly Ser Leu His Asn Ser Ser Ser Gly 
1055 1060 1065 

Glu Ser Thr Tyr Cys Met Pro Gin Asn Ala Gly Asp Leu Pro Ser 
1070 1075 1080 



Ala lie Thr Ser Gly Ser Ser Val Thr Phe Ser Asn Ser Tyr Gly 

1100 1105 1110 

Ser Gin Trp Ser Pro . 
1115 

Ser Ser Gly Ala Tyr Arg Phe Ser Ser Glu Gly Ala Gin Ser Ser 

1130 1135 1140 

Phe Glu Asp Ser Glu Glu Asp Phe Asp Ser Arg Ph e Asp Thr Asp 
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1145 










1150 










1155 










1160 










1165 










1170 










1175 




Phe 


His 


Ser 


1180 










1185 








Asn 


Ser 
1190 


Trp 


Lys 


Arg 


Arg 


Trp 
1195 


Cys 


Val 


Leu 


Lys 


Asp 
1200 


Glu 


Thr 


Phe 




1205 


Phe 


Arg 


Ser 


Lys 


1210 










1215 








His 


Lys 
1220 


Lys 


Gly Gly 


Gly 


Ser 
1225 


Ser 


Thr 


Leu 


Ser 


Arg 
1230 


Arg 


Asn 


Trp 


Lys 


Lys 
1235 


Arg 


Trp 


Phe 


Val 


Leu 
1240 


Arg 


Gin 


Ser 


Lys 


Leu 
1245 


Met 


Tyr 


Phe 


Glu 


Asn 
1250 


Asp 


Ser 


Glu 


Glu 


Lys 
1255 


Leu 


Lys 


Gly 


Thr 


Val 
1260 


Glu 


Val 


Arg 


Thr 


Ala 
1265 


Lys 


Glu 


He 


He 


Asp 
1270 


Asn 


Thr 


Thr 


Lys 


Glu 
1275 


Asn 


Gly 


He 


Asp 


lie 
1280 


He 


Met 


Ala 


Asp 


Arg 
1285 


Thr 


Phe 


His 


Leu 


He 
1290 


Ala 


Glu 


Ser 


Pro 


Glu 
1295 


Asp 


Ala 


Ser 


Gin 


Trp 
1300 


Phe 


Ser 


Val 


Leu 


Ser 
1305 


Gin 


Val 


His 


Ala 


Ser 
1310 


Thr 


Asp 


Gin 


Glu 


He 
1315 


Gin 


Glu 


Met 


His 


Asp 
1320 


Glu 


Gin 


Ala 


Asn 


Pro 
1325 


Gin 


Asn 


Ala 


Val 


Gly 
1330 


Thr 


Leu 


Asp 


Val 


Gly 
1335 


Leu 


He 


Asp 


Ser 


Val 
1340 


Cys 


Ala 


Ser 


Asp 


Ser 
1345 


Pro 


Asp 


Arg 


Pro 


Asn 
1350 


Ser 


Phe 


Val 


lie 


lie 
1355 


Thr 


Ala 


Asn 


Arg 


Val 
1360 




His 


Cys 


Asn 


Ala 
1365 


Asp 


Thr 


Pro 


Glu 


Glu 
1370 


Met 


His 


His 


Trp 


He 
1375 


Thr 


Leu 


Leu 


Gin 


Arg 
1380 


Ser 


Lys 


Gly 


Asp 


Thr 
1385 


Arg 


Val 


Glu 


Gly 


Gin 
1390 


Glu 


Phe 


He 


Val 


Arg 
1395 


Gly 


Trp 




His 


Lys 
1400 


Glu 


Val 


Lys 


Asn 


Ser 

1405 


Pro 


Lys 


Met 


Ser 


Ser 
1410 


Leu 


Lys 


Leu 


Lys 


Lys 
1415 


Arg 


Trp 


Phe 


Val 


Leu 
1420 


Thr 


His 


Asn 


Ser 


Leu 
1425 


Asp 


Tyr 


Tyr 


Lys 


Ser 
1430 


Ser 


Glu 


Lys 


Asn 


Ala 
1435 


Leu 


Lys 


Leu 


Gly 


Thr 
1440 


Leu 


Val 




Asn 


Ser 
1445 


Leu 


Cys 


Ser 


Val 


Val 
1450 


Pro 


Pro 


Asp 


Glu 


Lys 
1455 


He 


Phe 


Lys 


Glu 


Thr 
1460 


Gly 


Tyr 


Trp 


Asn 


Val 
1465 


Thr 


Val 


Tyr 


Gly 


1470 


Lys 


His 


Cys 


Tyr 


Arg 


Leu 


Tyr 


Thr 




Leu 






Glu 


Ala 


Thr 


Arg 


Trp 


Ser 
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1475 










1480 










1485 










1490 










1495 










1500 


lie 


Asp 


Thr 


Pro 


Thr 
1505 


Gin 


Gin 


Leu 


He 


Gin 
1510 


Asp 


He 


Lys 


Glu 


Asn 
1515 


Cys 


Leu 


Asn 


Ser 


Asp 
1520 


Val 


Val 


Glu 


Gin 


He 
1525 


Tyr 


Lys 






Pro 
1530 


He 


Leu 


Arg 


Tyr 


Thr 
1535 


His 


His 


Pro 


Leu 


His 
1540 


Ser 


Pro 


Leu 


Leu 


Pro 
1545 


Leu 


Pro 


Tyr 


Gly Asp 
1550 


He 




Leu 




Leu 
1555 


Leu 


Lys 


Asp 


Lys 


Gly 
1560 


Tyr 


Thr 


Thr 


Leu 


Gin 




Glu 


Ala 


He 


Lys 
1570 


He 


Phe 


Asn 


Ser 


Leu 
1575 


Gin 


Gin 


Leu 


Glu 


Ser 


Met 


Ser 


Asp 


Pro 


He 
1585 


Pro 


He 


He 


Gin 


Gly 
1590 


He 


Leu 


Gin 


Thr 


Gly 


His 


Asp 


Leu 


Arg 


Pro 
1600 


Leu 


Arg 


Asp 


Glu 


160 5 


Tyr 


Cys 


Gin 


Leu 


lie 


Lys 


Gin 


Thr 


Asn 


Lys 
1615 


Val 


Pro 


His 


Pro 


Gly 
1620 


Ser 


Val 


Gly 


Asn 


Leu 


Tyr 


Ser 


Trp 


Gin 


He 
1630 


Leu 


Thr 


Cys 


Leu 


Ser 
1635 


Cys 


Thr 


Phe 


Leu 


Pro 


Ser 


Arg 


Gly 


He 


Leu 
1645 


Lys 


Tyr 


Leu 


Lys 


Phe 
1650 


His 


Leu 


Lys 


Arg 


He 




Glu 


Gin 


Phe 


Pro 
1660 


Gly 


Thr 


Glu 


Met 


Glu 
1665 


Lys 


Tyr 


Ala 


Leu 


Phe 
1670 


Thr 


Tyr 


Glu 


Ser 


Leu 
1675 


Lys 


Lys 


Thr 


Lys 


Cys 
1680 


Arg 


Glu 


Phe 


Val 


Pro 
1685 


Ser 


Arg 


Asp 


Glu 


He 
1690 


Glu 


Ala 


Leu 


lie 


His 
1695 


Arg 


Gin 


Glu 


Met 


Thr 
1700 


Ser 


Thr 


Val 


Tyr 


Cys 
1705 


His 


Gly 


Gly 


Gly 


Ser 
1710 


Cys 


Lys 


He 


Thr 


He 
1715 


Asn 


Ser 


His 


Thr 


Thr 
1720 


Ala 


Gly Glu 


Val 


Val 
1725 


Glu 


Lys 


Leu 


lie 


Arg 
1730 


Gly 


Leu 


Ala 


Met 


Glu 
1735 


Asp 


Ser 


Arg 


Asn 


Met 
1740 


Phe 


Ala 


Leu 


Phe 


Glu 
1745 


Tyr 


Asn Gly His 


Val 
1750 


Asp 


Lys 


Ala 


He 


Glu 
1755 


Ser 


Arg 


Thr 


Val 


Val 
1760 


Ala 


Asp 


Val 


Leu 


Ala 
1765 


Lys 


Phe 


Glu 


Lys 


Leu 
1770 


Ala 


Ala 


Thr 


Ser 


Glu 
1775 


Val 


Gly Asp 


Leu 


Pro 
1780 


Trp 


Lys 


Phe 


Tyr 


Phe 
1785 


Lys 


Leu 


Tyr 


Cys 


Phe 
1790 


Leu 


Asp 


Thr 


Asp 


Asn 
1795 


Val 


Pro 


Lys 


Asp 


Ser 
1800 


Val 


Glu 


Phe 


Ala 


Phe 


Met 


Phe 


Glu 


Gin 


Ala 


His 


Glu 


Ala 


Val 


He 


His 


Gly 


His 
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1805 










1810 










1815 








His 


Pro 
1820 


Ala 


Pro 


Glu 


Glu 


Asn 
1825 


Leu 


Gin 


Val 


Leu 


Ala 
1830 


Ala 


Leu Arg 


Leu 


Gin 
1835 


Tyr 


Leu 


Gin 


Gly 


Asp 
1840 


Tyr 


Thr 


Leu 


His 


Ala 
1845 


Ala 


He 


Pro 


Pro 


Leu 
1850 


Glu 


Glu 


Val 


Tyr 


Ser 
1855 


Leu 


Gin 


Arg 


Leu 


Lys 
1860 


Ala 


Arg 


He 


Ser 


Gin 
1865 


Ser 


Thr 


Lys 


Thr 


Phe 
1870 


Thr 


Pro 


Cys 


Glu 


Arg 
1875 


Leu 


Glu 


Lys 


Arg 


Arg 
1880 


Thr 


Ser 


Phe 


Leu 


Glu 
1885 


Gly 


Thr 


Leu 


Arg 


Arg 
1890 


Ser 


Phe 


Arg 


Thr 


Gly 
1895 


Ser 


Val 


Val 


Arg 


Gin 
1900 


Lys 


Val 


Glu 


Glu 


Glu 
1905 


Gin 


Met 


Leu 


Asp 


Met 
1910 


Trp 


He 


Lys 


Glu 


Glu 
1915 


Val 


Ser 


Ser 


Ala 


Arg 
1920 


Ala 


Ser 


He 


He 


Asp 
1925 


Lys 


Trp 


Arg 


Lys 


Phe 
1930 


Gin 


Gly 


Met 


Asn 


Gin 
1935 


Glu 


Gin 


Ala 


Met 


Ala 
1940 


Lys 


Tyr 


Met 


Ala 


Leu 
1945 


He 


Lys 


Glu 


Trp 


Pro 
1950 


Gly 


Tyr 


Gly 


Ser 


Thr 
1955 


Leu 


Phe 


Asp 


Val 


Glu 
1960 


Val 


Arg 


Thr 


Gly 


Cys 
1965 


His 


Val 


Leu 


Gly 


Trp 
1970 


Ala 


Gly 


Cys 


Trp 


His 
1975 


Leu 


Arg 


Thr 


Trp 


lie 
1980 


Thr 


Ala 


Lys 


Phe 


Met 
1985 


Trp 


Arg 


Glu 


Asp 


Lys 
1990 


Met 


Glu 


His 


Phe 


Ala 

1995 


Leu 


Ser 


Thr 


Ser 


Phe 
2000 


Phe 


Arg 


Ala 


Pro 


Lys 
2005 


He 


Val 


Pro 


Leu 


Thr 
2010 


Pro 


Pro 


Phe 


Ser 


Ser 
2015 


Gin 


Phe 


Leu 


Phe 


Ser 
2020 


Cys 


Val 


Val 


Asn 


Ala 
2025 


Ser 


Val 


He 


Leu 


Gly 
2030 


Met 


Asn 


Ala 


Lys 


Leu 
2035 


Arg 


Cys 


His 


Leu 


Phe 
2040 


Phe 


Tyr 


Pro 


Ser 


2045 


Gly 


Lys 


Leu 























Table 13 hMP nucleotide sequence (SEQ ID NO: 13) 



ccaacttttg 


cagctccacc 


caggatgtgg 


cctcgctcca 


ccccagctgt 


gcgcctctct 


60 


ccacccctag 


gcgaaggcac 


tagaatttcc 


caaattaaga 


acgaagagga 


agtttggacc 


120 


ttttcggcca 


ccgctcgctt 


caatatggct 


gcccccaggg 


agagacgagg 


ctaccatgaa 


180 


ggagccgagc 


gcagaccctg 


agtccgtcac 


ccatggatcg 


cagcgcggag 


ttcaggaaat 


240 


ggaaggcgca 


atgtttgagc 


aaagcggacc 


tcagccggaa 


gggcagtgtt 


gacgaggatg 


300 


tggtagagct 


tgtgcagttt 


ctgaacatgc 


gagatcagtt 


tttcaccacc 


agctccttcg 


360 


ctggccgcat 


cctactcctt 


gaccggggta 


taaatggttt 


tgaggttcag 


aaacaaaact 


420 


gttgctggct 


actggttaca 


cacaaacttt 


gtgtaaaaga 


tgatgtgatt 


gtagctctga 


480 
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agaaagcaaa 


tggtgatgcc 


actttgaaat 


fctgaaccatt 


tgttcttcat 


gtgcagtgtc 


540 


gacaattgca 


ggatgcacag 


attctgcatt 


ccatggcaat 


agattctggt 


ttcaggaact 


600 


ctggcataac 


ggtgggaaag 


agaggaaaaa 


ctatgttggc 


tgtccggagt 


acacatggct 


660 


tagaagttcc 


attaagccat 


aagggaaaac 


tgatggtgac 


agaggaatat 


attgacttcc 


720 


tgttaaatgt 


ggcaaatcaa 


aaaatggagg 


aaaacaagaa 


aagaattgag 


aggttttaca 


780 


actgcctaca 


gcatgctttg 


gaaagggaaa 


cgatgactaa 


cttacatccc 


aagatcaaag 


840 


agaaaaataa 


ctcatcatat 


attcataaga 


aaaaaagaaa 


cccagaaaaa 


acacgtgccc 


900 


agtgtattac 


taaagaaagt 


gatgaagaac 


ttgaaaatga 


tgatgatgat 


gatctaggaa 


960 


tcaatgttac 


catcttccct 


gaagattact 


aagctttggt 


tctgatgtgt 


cttggccgta 


1020 


atgtttctag 


taggttttat 


aaagctgctc 


ttcataagag 


tattttagtt 


tgttgagtgt 


1080 


atcagccatt 


cataagccag 


taatgacaag 


tgcagagctt 


caaactataa 


ctttgttgcc 


1140 


cagaggatgt 


gcagttgtca 


tctaagctct 


cagcagtacc 


cggcttatcc 


tacgacttca 


1200 


cctgaaatgc 


tatagttatc 


cctacttttt 


taccagtttc 


tcccagaagc 


acctgcttaa 


1260 


taaatcaaag 


atgtttgaaa 
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Table 14 hMP polypeptide sequence (SEQ ID NO: 1 4) 



Met 
1 


Asp 


Arg 


Ser 


Ala 
5 


Glu 


Phe 


Arg 


Lys 


Trp 
10 


Lys 


Ala 


Gin 


Cys 


Leu 
15 


Ser 




Ala 




20 










25 










30 








Val 


Gin 
35 


Phe 


Leu 


Asn 


Met 


Arg 
40 


Asp 


Gin 


Phe 


Phe 


Thr 
45 


Thr 


Ser 


Ser 


Phe 


Ala 
50 


Gly 


Arg 


He 


Leu 


Leu 
55 


Leu 


Asp 


Arg 


Gly 


He 
60 


Asn 


Gly 


Phe 


Glu 


Val 
65 


Gin 


Lys 


Gin 


Asn 


Cys 
70 


Cys 


Trp 




Leu 


Val 
75 


Thr 


His 


Lys 


Leu 


Cys 
80 


Val 


Lys 


Asp 


Asp 


Val 
85 


He 


Val 


Ala 


Leu 


Lys 
90 


Lys 


Ala 


Asn 


Gly 


Asp 
95 


Ala 


Thr 


Leu 


Lys 


Phe 
100 


Glu 


Pro 


Phe 


Val 


Leu 
105 


His 


Val 


Gin 


Cys 


Arg 
110 


Gin 


Leu 


Gin 


Asp 


Ala 
115 


Gin 


He 


Leu 


His 


Ser 
120 


Met 


Ala 


He 


Asp 


Ser 
125 


Gly 


Phe 


Arg 


Asn 


Ser 
130 


Gly 


lie 


Thr 


Val 


Gly 
135 


Lys 


Arg 


Gly 


Lys 


Thr 
140 


Met 


Leu 


Ala 


Val 


Arg 
145 


Ser 


Thr 


His 


Gly 


Leu 
150 


Glu 


Val 


Pro 


Leu 


Ser 
155 


His 


Lys 


Gly 


Lys 


Leu 
160 


Met 


Val 


Thr 


Glu 


Glu 
165 


Tyr 


He 


Asp 


Phe 


Leu 
170 


Leu 


Asn 


Val 


Ala 


Asn 
175 


Gin 


Lys 


Met 


Glu 


Glu 
180 


Asn 


Lys 


Lys 


Arg 


He 
185 


Glu 


Arg 


Phe 


Tyr 


Asn 
190 


Cys 


Leu 


Gin 


His 


Ala 
195 


Leu 


Glu 


Arg 


Glu 


Thr 
200 


Met 


Thr 


Asn 


Leu 


His 
205 


Pro 




He 


Lys 


Glu 


Lys 


Asn 


Asn 


Ser 


Ser 


Tyr 


He 


His 




Lys 


Lys 




Asn 


Pro 
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210 








215 








220 










Glu 
225 


Lys 


Thr 


Arg 


Ala Gin 
230 


Cys 


He Thr 


Lys 


Glu 
235 


Ser 


Asp 


Glu 


Glu 


Leu 
240 


Glu 


Asn 


Asp 


Asp 


Asp Asp 
245 


Asp 


Leu Gly 


He 
250 


Asn 


Val 


Thr 


He 


Phe 
255 


Pro 


Glu 


Asp 


Tyr 

























Table 15 NHR nucleotide sequence (SEQ ID NO: 1 5) 



acgcgtgcag 


gtggcgtggc 


gccagggatt 


tgaaccgcgc 


tgacgaagtt 


tggtgatcca 


60 


tcttccgagt 


atcgccggga 


tttcgaatcg 


cgatgatcat 


cccctctcta 


gaggagctgg 


120 


actccctcaa 


gtacagtgac 


ctgcagaact 


tagccaagag 


tctgggtctc 


cgggccaacc 


180 


tgagggcaac 


caagttgtta 


aaagccttga 


aaggctacat 


taaacatgag 


gcaagaaaag 


240 


gaaatgagaa 


tcaggatgaa 


agtcaaactt 


ctgcatcctc 


ttgtgatgag 


actgagatac 


300 


agatcagcaa 


ccaggaagag 


ctgagagaca 


gccacttggc 


catgtcacca 


aaacaaggag 


360 


aaggtgcaag 


actgtccgtg 


tggaccctga 


ctcacagaga 


atcattcaga 


gataaaaata 


420 


agtaatccca 


ctgaattcca 


gaatcatgaa 


aagcaggaaa 


gccaggatct 


cagagcactg 


480 


caaaagttcc 


ttctccacca 


gacgagcacc 


aagaagctga 


gaatgctgtt 


tcctcaggta 


540 


acagagattc 


aaaggtacct 


tcagaaggaa 


agaaatctct 


ctacacagat 


gagtcatcca 


600 


aacctggaaa 


aaataaaaga 


actgcaatca 


ctactccaaa 


ctttaagaag 


cttcatgaag 


660 


ctcattttaa 


ggaaatggag 


tccattgatc 


caatatatng 


aggagaaaaa 


aagaaacatt 


720 


ttgaagaaca 


caattccatg 


aatgaactga 


agcagccgcc 


catcaataag 


ggaggggtca 


780 


ggactccagt 


acctccaaga 


ggaagactct 


ctgtggcttc 


tactcccatc 


agccaacgac 


840 


gctcgcaagg 


ccggtcttgt 


ggccctgcaa 


gtcagagtac 


cttgggtctg 


aaggggtcac 


900 


tcaagcgctc 


tgctatctct 


gcagctaaaa 


cgggtgtcag 


gttttcagct 


gctactaaag 


960 


ataatgagca 


taagcgttca 


ctgaccaaga 


ctccagccag 


aaagtctgca 


catgtgaccg 


1020 


tgtctggggg 


cacccaaaaa 


ggcgaggctg 


tgcttgggac 


acacaaatta 


aagaccatca 


1080 


cggggaattc 


tgctgctgtt 


attaccccat 


tcaagttgac 


aactgaggca 


acgcagactc 


1140 


cagtctccaa 


taagaaacca 


gtgtttgatc 


ttaaagcaag 


tttgtctcgt 


cccctcaact 


1200 


atgaaccaca 


caaaggaaag 


ctaaaaccat 


gggggcaatc 


taaagaaaat 


aattatctaa 


1260 


atcaacatgt 


caacaaatta 


acttctacaa 


gaaaacttac 


aaacaacccc 


atctccagac 


1320 


aaaggaagag 


caacggaaga 


aacgcgagca 


agaagaaagg 


agaagaaagc 


aaaggttttg 


1380 


ggaatgcgaa 


ggggcctcat 


tttggctgaa 


gattaataat 


tttttaacat 


cttgtaaata 


1440 


ttcctgtatt 


ctcaactttt 


ttccttttgt 


aaattttttt 


tttttgctgt 


catccccact 


1500 


ttagtcacga 


gatctttttc 


tgctaactgt 


tcatagtctg 


tgtagtgtcc 


atgggttctt 


1560 


catgtgctat 


gatctctgaa 


aagacgttat 


caccttaaag 


ctcaaattct 


ttgggatggt 


1620 


ttttacttaa 


gtccattaac 


aattcaggtt 


tctaacgaga 


cccatcctaa 


aattctcttt 


1680 


ctagtttttt 


aatgtcacca 


tcccaaactc 


ccgtttctgg 


atttttaatc 


cccagctccc 


1740 


cagttccctc 


ttatcgtact 


aatattaaca 


gaactgcagt 


cttctgctag 


ccaatagcat 


1800 


ttacctgatg 


gcagctagtt 


atgcaagctt 


caggagaatt 


tgaacaataa 


caagaatagg 


1860 


gtaagctggg 


atagaaaggc 


cacctcttca 


ctctctatag 


aatatagtaa 


cctttatgaa 


1920 


acggggccat 


atagtttggt 


tatgacatca 


atattttacc 


taggtgaaat 


tgtttaggct 


1980 


tatgtacctt 


cgttcaaata 


tcctcatgta 


attgccatct 


gtcactcact 


atattcacaa 


2040 
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aaataaaact 


ctacaactca 


ttctaacatt 


gcttacttaa 


aagctacata 


gccctatcga 


2100 


aatgcgagga 


ttaatgcttt 


aatgctttta 


gagacagggt 


ctcactgtgt 


tgcccaggct 


2160 


ggtctcaaac 


tccaccaaat 


gtacttctta 


ttcattttat 


ggaaaagact 


aggctttgct 


2220 


tagtatcatg 


tccatgtttc 


cttcacctca 


gtggagcttc 


tgagttttat 


actgctcaag 


2280 


atcgtcataa 


ataaaatttt 


ttctcattgt 


caaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


2340 


aaaaaaaaaa 


aa 










2352 



Table 16 NHR polypeptide sequence (SEQ ID NO: 16) 



Met 


He 


He 


Pro 


Ser 


Leu 


Glu 


Glu 


Leu 


Asp 


Ser 


Leu 


Lys 


Tyr 


Ser 


Asp 


Leu 


Gin 


Asn 


Leu 


Ala 


Lys 


Ser 


Leu 


Gly 


Leu 


Arg 


Ala 


Asn 


Leu 


Arg 


Ala 


Thr 


Lys 


Leu 


Leu 


Lys 


Ala 


Leu 


Lys 


Gly 


Tyr 


He 


Lys 


His 


Glu 


Ala 


Arg 


Lys 


Gly 


Asn 


Glu 


Asn 


Gin 


Asp 


Glu 


Ser 


Gin 


Thr 


Ser 


Ala 


Ser 


Ser 


Cys 


Asp 


Glu 


Thr 


Glu 


He 


Gin 


He 


Ser 


Asn 


Gin 


Glu 


Glu 


Ala 


Glu 


Arg 


Gin 


Pro 


Leu 


Gly 


His 


Val 


Thr 


Lys 


Thr 


Arg 


Arg 


Arg 


Cys 


Lys 


Thr 


Val 


Arg 


Val 


Asp 


Pro 


Asp 


Ser 


Gin 


Gin 


Asn 


His 


Ser 


Glu 


He 


Lys 


He 


Ser 


Asn 


Pro 


Thr 


Glu 


Phe 


Gin 


Asn 


His 


Glu 


Lys 


Gin 


Glu 


Ser 


Gin 


Asp 


Leu 


Arg 


Ala 


Thr 
13 0 


Ala 


Lys 


Val 


Pro 


Ser 
135 


Pro 


Pro 


Asp 


Glu 


His 
140 


Gin 


Glu 


Ala 


Glu 


Asn 
145 


Ala 


Val 


Ser 


Ser 


Gly 
150 


Asn 


Arg 


Asp 


Ser 


Lys 
155 


Val 


Pro 


Ser 


Glu 


Gly 
160 


Lys 


Lys 


Ser 


Leu 


Tyr 
165 


Thr 


Asp 


Glu 


Ser 


Ser 
170 


Lys 


Pro 


Gly 


Lys 


Asn 
175 




Arg 


Thr 


Ala 


He 
180 


Thr 


Thr 


Pro 


Asn 


Phe 
185 


Lys 


Lys 


Leu 


His 


Glu 
190 


Ala 


His 


Phe 


Lys 


Glu 
195 


Met 


Glu 


Ser 


He 


Asp 
200 


Pro 


He 


Tyr 


Xaa 


Gly 
205 


Glu 


Lys 


Lys 


Lys 


His 
210 


Phe 


Glu 


Glu 


His 


215 


Ser 


Met 




Glu 


Leu 
220 




Gin 


Pro 


Pro 


He 
225 


Asn 


Lys 


Gly 


Gly 


Val 
230 


Arg 


Thr 


Pro 


Val 


Pro 
235 


Pro 


Arg 


Gly 


Arg 


Leu 
240 


Ser 


Val 


Ala 


Ser 


Thr 
245 


Pro 


He 


Ser 


Gin 


Arg 
250 


Arg 


Ser 


Gin 


Gly 


Arg 
255 


Ser 


Cys 


Gly 


Pro 


Ala 
260 


Ser 


Gin 


Ser 


Thr 


Leu 
265 


Gly 


Leu 


Lys 


Gly 


Ser 
270 


Leu 




Arg 


Ser 


Ala 


He 


Ser 


Ala 


Ala 


Lys 


Thr 


Gly 


Val 


Arg 


Phe 


Ser 


Ala 


Ala 
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275 










280 










285 










290 










295 










300 






Ala 




305 










310 




Ser 






315 






Gly 




Ala 
320 










325 










He 
330 










Ala 
335 


Ala 








340 










345 










350 










355 










3 60 










365 










370 










375 










380 










385 










390 








Val 


395 










400 


Lys 


Lys 


Thr 


Tyr 


Lys 
405 


Gin 


Pro 


His 


Leu 


Gin 
410 


Thr 


Lys 


Glu 


Glu 


Gin 
415 


Arg 


Lys 


Lys 


Arg 


Glu 
420 


Gin 


Glu 


Arg 


Lys 


Glu 
425 


Lys 


Lys 


Ala 


Lys 


Val 
430 


Leu 


Gly 


Met 


Arg 


Arg 
435 


Gly 


Leu 


He 


Leu 


Ala 
440 


Glu 


Asp 















Table 17 displays alignment of hMXl, hMX2 with human myosin (SEQ ID 
NO:31; GenBank AF247457) (Berg et al, 2000). As seen from the alignment, hMXl and 
hMX2 have a likely N-terminus of M N D residues. One of skill in the art can easily 
verify this observation by probing cDNA or genomic human libraries, or PCR techniques, 
to acquire the full length polynucleotide sequence. 



Table 17 Alignment of hMXl , hMX2 and human myosin X 


10 


1 


FC LQGTRVWLRENGQHFPSTVNSCAEG I WFRTDYGQVFTYKQST I T 


12 


1 


FCLQGTRVWLRENGQHFPSTVNSCAEGIWFRTDYGQVFTYKQSTIT 


humX 


1 


MDNFFTEGTRVWLRENGQHFPSTVNSCAEGIWFRTDYGQVFTYKQSTIT 


10 


48 


HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIWTYIGSIL 


12 


48 


HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIWTYIGSIL 


humX 


51 


HQKVTAMHPTNEEGVDDMASLTELHGGSIMYNLFQRYKRNQIYTYIGSIL 


10 


98 


ASVNPYQ P I AGL YEP ATMEQY SRRHLGELP PH I FAIANECYRC LWKRHDN 


12 


98 


ASVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRHDN 


humX 


101 


ASVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRYDN 


10 


148 


QCILIKGESGAGKTESTKLILKFLSVISQQSLELSLKEKTSCVERAILES 


12 


148 


QCILIKGESGAGKTESTKLILKFLSVISQQSLELSLKEKTSCVERAILES 


humX 


151 


QCILISGESGAGKTESTKLILKFLSVISQQSLELSLKEKTSCVERAILES 


10 


198 


SPIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQGGRIVDCILSSQNR 
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12 198 SPIMEAFGNAKTVYNWNSSRFGKFVQLNICQKGNIQGGRIVDCILSSQNR 

humX 2 01 S P IME AF GN AKTVYNNN S S RF GKF VQLN I C QKGNI QGGR IVD YLL E - KNR 

10 248 WRQNPGERNYHIFYALLAGLEHEEREEFYLSTPENYHYLNQSGCVEDKT 

12 248 WRQNPGERNYHIFYALLAGLEHEEREEFYLSTPEWYHYLNQSGCVEDKT 

humX 25 0 WRQNPGERNYHIFYALLAGLEHEEREEFYLSTPENYHYLNQSGCVEDKT 

10 2 98 ISDQESFREVITAMDVMQFSKEEVREVSRLLAGILHLGNIEFITAGGAQV 

12 298 I SDQESFREV I TAMDVMQF SKEEVREVSRLLAGILHLGNI E F I T AGGAQV 

humX 3 00 ISDQESFREVITAMDVMQFSKEEVREVSRLLAGILHLGNIEFITAGGAQV 

10 348 SFKTALGRSAELLGLDPTQLTDALTQRSMFLRGEEILTPLNVQQAVDSRD 

12 348 S FKTALGRS AELLGLD PTQLTDALTQRSMFLRGEE I LT PLNVQQAVDSRD 

humX 350 SFKTALGRSAELLGLDPTQLTDALTQRSMFLRGEEILTPLNVQQAVDSRD 

10 3 98 SLAMALYACCFEWVIKKINSRIKGNEDFKSIGILDIFGFENFEVNHFEQF 

12 398 SLAMALYACCFEWIKKINSRIKGNEDFKSIGILDIFGFENFEVNHFEQF 

humX 400 SLAMALYACCFEWVIKKINSRIKGNEDFKSIGILDIFGFENFEVNHFEQF 

10 448 NINYANEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDNGECLDLIEKK 

12 44 8 NINYANEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDNGECLDLIEKK 

humX 45 0 NINYAMEKLQEYFNKHIFSLEQLEYSREGLVWEDIDWIDNGECLDLIEKK 

10 498 LGLLALINEESHFPQATDSTLLEKLHSQHANNHFYVKPRVAVNNFGVKHY 

12 49 8 LGLLALINEESHFPQATDSTLLEKLHSQHANNHFYVKPRVAVNNFGVKHY 

humX 50 0 LGLLALINEESHFPQATDSTLLEKLHSQHANNHFYVKPRVAVNNFGVKHY 

10 548 AGEVQYDVRGILEKNRDTFRDDLLNLLRESRFDFIYDLFEHVSSR3MNQDT 

12 548 AGEVQYDVRGILEKNRDTFRDDLLNLLRESRFDFIYDLFEHVSSRNNQDT 

humX 55 0 AGEVQ YDVRGI LEKNRDTFRDDLLNLLRE S RFDF I YDLFEHVSSRNNQDT 

10 598 LKCGSKHRRPTVSSQFKVDSLHSLMATLSSSNPFFVRCIKPNMQFMPDQF 

12 59 8 LKCGSKHRRPTVSSQFKVDSLHSLMATLSSSNPFFVRCIKPNMQKMPDQF 

humX 6 00 LKCGSKHRRPTVSSQFKDS-LHSLMATLSSSNPFFVRCIKPNMQKMPDQF 

10 648 DQAWLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPED 

12 648 DQAWLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPED 

humX 649 DQAWLNQLRYSGMLETVRIRKAGYAVRRPFQDFYKRYKVLMRNLALPED 

10 698 VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV 

12 698 VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV 

humX 699 VRGKCTSLLQLYDASNSEWQLGKTKVFLRESLEQKLEKRREEEVSHAAMV 

10 748 IRAHVLGFLARKQYRKVLYCWIIQKNYRAFLLRRRFLHLKKAAIVFQKQ 

12 748 IRAHVLGFLARKQYRKVLYCWIIQKNYRAFLLRRRFLHLKKAAIVFQKQ 

humX 749 IRAHVLGFLARKQYRKVLYCWIIQKNYRAFLLRRRFLHLKKAAIVFQKQ 

10 79 8 LRGQ I ARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERRE AEL 

12 798 LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL 

humX 799 LRGQIARRVYRQLLAEKREQEEKKKQEEEEKKKREEEERERERERREAEL 

10 848 RAQQEEETRKQQELEALQKSQKEAELTRELEKQKENKQVEEILRLEKEIE 

12 848 RAQQEEETRKQQELEALQKSQKEAELTRELEKQKENKQVEEILRLEKEIE 

humX 84 9 RAQQEEETRKQQELEALQKSQKEAELTRELEKQKENKQVEEILRLEKEIE 

10 898 DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD 

12 898 DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD 

humX 899 DLQRMKEQQELSLTEASLQKLQERRDQELRRLEEEACRAAQEFLESLNFD 

10 948 EIDECVRNIERSLSGGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE 

12 94 8 EIDECVRNIERSLSGGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE 

humX 949 EIDECVRNIERSLSVGSEFSSELAESACEEKPNFNFSQPYPEEEVDEGFE 

10 998 ADDDAFKDSPNPSEHGHSDQRTSGIRTSDDSSEEDPYMNDTWPTSPSAD 

12 998 ADDDAFKDSPNPSEHGHSDQRTSGIRTSDDSSEEDPYMNDTWPTSPSAD 

humX 999 ADDDAFKDSPMPSEHGHSDQRTSGIRTSDDSSEEDPYMNDTWPTSPSAD 
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iq 1048 STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYED 

12 1048 STVLLAPSVQDSGSLHNS SSGE S TYCMPQNAGDLPS PDGDYDYDQDDYED 

humX 1049 STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPSPDGDYDYDQDDYED 

10 1098 GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 

12 10 98 GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 

humX 1099 GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 

10 1148 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

12 1148 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

humX 1149 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

10 119 8 LKDETFLWFRSKQEALKQGWLHKKGGGS STLSRRNWKKRWFVLRQSKLMY 

12 1198 LKDETFLWFRS KQE ALKQGWLHKKGGGS S TL S RRNWKKRWF VLRQS KLMY 

humX 1199 LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMY 

10 1248 FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDA 

12 124 8 FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDA 

humX 12 49 FENDSEEKLKGTVEVRTAKEIIDNTTKENGIDIIMADRTFHLIAESPEDA 

10 12 98 SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD 

12 12 9 8 SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD 

humX 12 99 SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD 

10 1348 RPN S FVI I T AHRVLHCNADTPEEMHHWI TLLQRSKGDTRVEGQEF I VRGW 

12 1348 RPNSFVIITANRVLHCNADTPEEMHHWITLLQRSKGDTRVEGQEFIVRGW 

humX 1349 RPNSFVIITANRVLHCNADTPEEMHHWITLLQRSKGDTRVEGQEFIVRGW 

10 13 9 8 LHKEVKNSPKMSSLKLKKRWFVLTHNSLDYYKSSEKNALKLGTLVLNSLC 

12 13 98 LHKEVKNSPKMSSLKLKKRWFVLTHNSLDYYKSSEKNALKLGTLVLNSLC 

humX 13 99 LHKEVKNSPKMSSLKLKKRWFVLTHNSLDYYKSSEKMALKLGTLVLNSLC 

10 1448 S WP PDEKI FKETGYTCNVTVYGRKHC YRL YTKLLNEATRWS S VI QNVTDT 

12 1448 SWPPDEKIFKETGYWNVTVYGRKHCYRLYTKLLNEATRWSSVIQNVTDT 

humX 1449 SWPPDEKIFKETGYWNVTVYGRKHCYRLYTKLLNEATRWSSAIQNVTDT 

10 1498 KAPIDTPTQQLIQDIKENCLNSDWEQIYKRNPILRYTHHPLHSPLLPLP 

12 1498 KAPIDTPTQQLIQDIKENCLNSDWEQIYKRNPILRYTHHPLHSPLLPLP 

humX 149 9 KAPIDTPTQQLIQDIKENCLNSDWEQIYKRNPILRYTHHPLHSPLLPLP 

10 1548 YGDINLNLLKDKGYTTLQDEAIKIFNSLQQLESMSDPIPIIQGILQTGHD 

12 1548 YGDINLNLLKDKGYTTLQDEAIKIFNSLQQLESMSDPIPIIQGILQTGHD 

humX 1549 ygdinlnllkdkgyttlqdeaikifnslqqlesmsdpipiiqgilqtghd 

10 1598 LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY 

12 1598 LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY 

humX 15 99 LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY 

10 164 8 LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ 

12 1648 LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ 

humX 1649 LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ 

10 1698 EMTSTVYCHGGGSCKITINSHTTAGEWEKLIRGLAMEDSRNMFALFEYN 

12 169 8 EMTSTVYCHGGGSCKITINSHTTAGEWEKLIRGLAMEDSRNMFALFEYN 

humX 1699 EMTSTVTCHGGGSCKITINSHTTAGEWEKLIRGLAMEDSRHMFALFEYN 

10 1748 GHVDKAIESRTWADVLAKFEKLAATSEVGDLPWKFYFKLYCFLDTDNVP 

12 1748 GHVDKAIESRTWADVLAKFEKLAATSEVGDLPWKFYFKLYCFLDTDNVP 

humX 1749 GHVDKAIESRTWADVLAKFEKLAATSEVGDLPWKFYFKLYCFLDTDNVP 

10 1798 KD SVEFAFMFE Q AHE AV I HGHH PAP EENLQVL AALRLQYLQGDYTLH AAI 

12 1798 KDSVEFAFMFEQAHEAVIHGHHPAPEE3MLQVLAALRLQYLQGDYTLHAAI 

humX 179 9 KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAI 

10 1848 PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 
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12 

humX 


1848 
1849 


PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 
PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 


10 
12 

humX 


1898 
1898 
1899 


VRQKVEEEQMLDMWIKEEVSSARASIIDKWRKFQGMNQEQAMAKYMALIK 
VRQKVEEEQMLDMWI KEEVS SARAS I I DKWRKF QGMNQEQAMAKYMALI K 
VRQKVEEEQMLDMWIKEEVSSARASIIDKWRKFQGMNQEQAMAKYMALIK 


10 
12 

humX 


1948 
1948 
1949 


EWPGYGSTLFDVECKEGGFPQELWLGVSADAVSVYKRGEGRPLEVFQYEH 
F^PGYGSTLFDVEVRTG-CHVLGWAGCWHLRTWITAKFMWREDKMEHFAL 
EWPGYGSTLFDVECKEGGFPQELWLGVSADAVSVYKRGEGRPLEVFQYEH 


10 
12 

humX 


1998 
1997 
1999 


ILSFGAPLANTYKIWDERELLFETSEWDVAKLMKAYISMIVKKRYSTT 
STSFFRAPKIVPLTPPFSSQFLFSCWNASVILGMNAKLRCHLFFYPSLG 
ILSFGAPLANTYKIWDERELLFETSEWDVAKLMKAYISMIVKKRYSTT 


10 
12 

humX 


2048 
2047 
2049 


RSASSQGSSR 

KL 

RSASSQGSSR 



The invention also includes polypeptides and nucleotides having 80-100%, 
including 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, 
sequence identity to SEQ ID NOS.T-16, as well as nucleotides encoding any of these 

5 polypeptides, and compliments of any of these nucleotides. In an alternative 

embodiment, polypeptides and/or nucleotides (and compliments thereof) identical to any 
one of, or more than one of, SEQ ID NOS:l-16 are excluded. In yet another embodiment, 
polypeptides and/or nucleotides (and compliments thereof) having 81-100% identical, 
including 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, 

0 sequence identity to SEQ ID NOS:l-16 are excluded. 

The nucleic acids and proteins of the invention are potentially useful in promoting 
wound healing, for example after organ transplantation, or in the treatment of myocardial 
infarction, but also in treating tumors, and in cancers, diabetic retinopathy, macular 
degeneration, psoriasis, and rheumatoid arthritis. For example, a cDNA encoding AAP 

5 may be useful in gene therapy, and AAP proteins may be useful when administered to a 

subject in need thereof. The novel nucleic acid encoding AAP, and the AAP proteins of 
the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 
These materials are further useful in the generation of Abs that bind immunospecifically 
) to the novel substances of the invention for use in therapeutic or diagnostic methods. 

Kelch-like protein (KLP) 
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The putative protein encoded by KLP contains 1 putative BTB domain and 4 
putative Kelch motifs. The BTB (broad complex, tramtrack, bric-a-brac )/POZ (poxvirus, 
zinc finger) domain is involved in protein protein interactions. The kelch motif is sixfold 
tandem element in the sequence of the Drosophila kelch ORF1 protein that also contains 
BTB. Kelch ORF1 localizes to the ring canals in the egg chamber and helps to organize 
the F-actin cytoskeleton (Adams et al, 2000). The repeated kelch motifs predict a 
conserved tertiary structure, a (3-propeller. This module appears in many different 
polypeptide contexts and contains multiple potential protein-protein interaction sites. 
Members of this growing superfamily are present throughout the cell and extracellularly 
and have diverse activities (Adams et al. , 2000). Such activities include cytoskeleton 
organization, as well as other morphological processes, gene expression, interactions with 
viruses, and various extracellular events, such as cell spreading. 

Alignment with Drosophila kelch and other kelch-like proteins, human kelch-like 
protein (GenBank AAF20938 (SEQ ID NO: 17)), hypothetical C. elegans (GenBank 
061795 (SEQ ID NO: 18) and the skeletal muscle-specific sarcosin (GenBank 060662 
(SEQ ID NO: 19); (Taylor et al, 1998)) reveals that the disclosed protein (SEQ ID NO:2) 
is a member of a new subfamily 

KLP is associated with tube formation and angiogenesis because it is upregulated 
in the in vitro model of angiogenesis of Example 1. Kelch mediates cytoskeletal 
associations, it is involved in morphogenetic processes, such as tube formation, that 
depend on cytoskeletal arrangements and signaling. KLP represents an attractive target 
for small molecule drug therapy. 

Human ortholog of mouse BAZF (hBAZF) 

hBAZF (SEQ ID NO:4) is the human ortholog of mouse BAZF (GenBank 
AB01 1665; SEQ ID NO:20), BAZF is a Bcl-6 (LAZ3) homolog, a transcription repressor 
that controls germinal center formation and the T cell -dependent immune response. 
Expression of Bcl-6 negatively correlates with cellular proliferation: Bcl-6 suppresses 
growth associated with impaired mitotic S phase progression and apoptosis (Albagli et 
al, 1999). 

BAZF contains a BTB/POZ domain and five repeats of the Kruppel-like zinc 
finger motifs, instead of 6 in Bcl-6 (Okabe et al, 1998). Expression of BAZF mRNA is 
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relegated to heart and lung, unlike Bcl-6 mRNA, but is induced in activated lymphocytes 
as an immediate-early gene, like Bcl-6 (Okabe et ah, 1998). 

The hBAZF sequence was derived by using tblastn (protein query -translated 
database) (Altschul et al., 1997), with the mouse protein sequence (GenBank 088282; 
SEQ ID NO:21) that has homology to GenBank AC015918 (SEQ ID NO:22), a clone of 
Homo sapiens chromosome 17. Human BAZF contains five Kruppel-like zinc finger 
motif repeats and a BTB/POZ domain. 

The peptide sequence, "RSQ. ..PQV" that is present in the human sequence, might 
represent an alternative spliced form of the gene. Aligment with mouse BAZF, and 
aligment with mouse and human Bcl-6 demonstrates that the four proteins are almost 
identical in this region, but only human BAZF has this inserted sequence. 

hBAZF is upregulated in HUVE cells grown embedded in collagen gels but not as 
a monolayer grown on collagen. When HUVE cells are suspended in collagen, they do 
not proliferate. Analagous to the role of mBAZF plays a role in regulating cell 
proliferation (Okabe et al, 1998), hBAZF plays a roll in cell proliferation in HUVE 
suspended in collagen. Because of its high expression during vessel morphogenesis, 
hBAZF represents an excellent molecular marker, as well as an attractive target for 
various therapies to inhibit angiogenesis. 

hmt-Elongation Factor G (hEF-G) 

The original isolation of hEF-G (SEQ ID NO:6) is 84% identical and colinear 
with Rattus norvegicus nuclear encoded mitochondrial elongation factor G (GenBank 
L14684 (SEQ ID NO:23); (Barker et al, 1993). No human gene is described in 
GenBank. However, searching EST databases, the human gene is contained inside 
GenBank AC010936 (SEQ ED NO:24), a chromosome 3 clone. Aligment of hEF-G with 
rat mtEF-G and yeast EF-G1 demonstrates that the novel sequence is the ortholog of rat 
nuclear-encoded mitochondrial elongation factor G. 

Bacterial elongation factor G (EF-G) physically associates with translocation- 
competent ribosomes and facilitates transition to the subsequent codon through the 
coordinate binding and hydrolysis of GTP. The deduced amino acid sequence of hmt-EF- 
G reveals characteristic motifs shared by all GTP binding proteins. Therefore, similarly 
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to other elongation factors, the enzymatic function of hmt-EF-G is predicted to depend on 
GTP binding and hydrolysis. 

Hmt-EF-G is strongly induced (30-fold) in an in vitro model of angiogenesis 
(Example 1), and as such, hmt-EF-G represents an excellent molecular marker for vessel 
formation. Because of its putative localization to the mitochondrion, hmt-EF-G is also an 
attractive therapeutic target to treat disease states associated with mitochondrial 
dysfunction. 

Human thyroid regulated transcript (hTRG) 

hTRG (SEQ ID NO:8) is the human ortholog of rat TRG, a novel thyroid 
transcript negatively regulated by TSH (GenBank KIAA1058 (SEQ ID NO:25); 
(Bonapace et al, 1990). 

SEQ ID NO:25 appears to be a partial peptide since there are C. elegans 
homologous proteins of 2000 residues. Using tblastn (Altschul et al, 1997) against 
genomic sequences, the hTRG sequence (SEQ ID NO: 8) was assembled. 

In C. elegans, homologous proteins localize either to the plasma membrane or to 
the mitochondrial inner membrane. A partial sequence, KIAA0694 (SEQ ID NO:26) also 
localizes to the mitochondrial matrix. hTRG has a PH domain, and has weak homology 
to an extracellular fibronectin-binding protein precursor. SEQ ID NO:26 has homology 
to Drosophila DOS and mouse Gab-2 proteins; both of which are involved in signal 
transduction, acting as adapter proteins between receptors and kinases like Rasl (Hibi and 
Hirano, 2000). 

Because of hTRG is upregulated during the in vitro model of angiogenesis 
(Example 1), and because of its homologies with adapter proteins, hTRG is likely to be 
involved in signal transduction between receptors and kinases. As such, hTRG represent 
an excellent candidate for small molecule drug therapy to modulate angiogenesis and treat 
angiogenesis-related diseases. In addition, because of its putative ability to respond to 
thyroid stimulating hormone (TSH), modulation of hTRG is useful to treat diseases 
related to TSH imbalance. 



Human myosin X (hMXl(SEQ ID NO: 10) and hMX2 (SEQ ID NO: 12) 
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The hMX proteins represent the human ortholog of bovine myosin X, (GenBank 
AAB39486; SEQ ID NO:27). Using tblastn (Altschul et al., 1997) and the bovine 
sequence, a series of genomic clones from human chromosome 5 were identified; 
GenBank AC010310 (SEQ ID NO:28) appears to contain the entire sequence. 
Interestingly, a partial cDNA sequence from mouse (GenBank AF 184153; SEQ ID 
NO:29) localizes to a 0.8 cM interval on the short arm of chromosome 5, between the 
polymorphic microsatellite markers D5S416 and D5S21 14. In this region lies the gene 
for familial chondrocalcinosis (CCAL2) (Rojas et al, 1999). 

Another GenBank entry, AB018342 (SEQ ID NO:30) that represents the 3' region 
of hMX, appears to encode an alternative splice form. Noteworthy, this variant (hMX2) 
has a very hydrophobic carboxy terminus, while the more prevalent form (hMXl) is 
hydrophilic and potentially interacts with DNA/RNA since it has homology to high 
mobility group box (HMG) and ribosomal proteins. Additionaly, a myosin head domain 
was found in the NH terminus, as well as a myosin talin domain, two calmodulin binding 
domains, four pleckstrin domains and a band 4.1 domain. 

The band 4.1 domain represents a crossroads between cytoskeletal organization 
and signal transduction. The domain was first described in the red blood cell protein band 
4.1. The ERM proteins ezrin, radixin, and moesin and the unconventional myosins Vila 
and X all possess the band 4.1 domain (Louvet-Vallee, 2000). The band 4.1 domain 
binds single transmembrane protein at the membrane-proximal region in the C-terminal 
cytoplasmic tail. 

HMX is upregulated during angiogenesis in an in vitro model (Example 1). 
Because hMX contains the protein-protein interaction domains PH and band 4.1 domain, 
hMXl and hMX2 are involved in angiogenesis, likely transducing signals from 
angiogenic factors, perhaps modulating the cytoskeleton. 

Human mitochondrial protein (hMP) 

Analysis of hMP (SEQ ID NO: 14) reveals several subdomain that are homologous 
to proteins involved in transport across membranes, K + ATPase a and y chains. Further 
analysis indicates that hMP may bind DNA and or RNA, since hMP is homologous to 
histones and transcription factors, especially those possessing basic region plus leucine 
zipper domains. 
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Although PSORT analysis (Nakai and Horton, 1999) predicts nuclear localization 
(P=.6), hMP may in fact be a nuclear-encoded mitochondrial protein. Homologies with 
mostly bacterial proteins and a PSORT prediction ofmitochondrial matrix space 
localization (P=0.4478) strongly support this contention. 

Because hMP is upregulated in an in vitro model of angiogenesis (Example 1), 
and because of its homologies with mitochondrial and nuclear-localized polypeptides, 
hMP is important in vascular morphogenesis, most likely through either powering the 
cellular differentiation-redifferentiation process, and/or affecting changes in the nuclear 
matrix that change global gene expression. Alternatively, hMP may be a transcription 
factor for either the nuclear or mitochondrial genomes. 

Nuclear hormone receptor (NHR) 

NHR (SEQ ID NO:16) has two domains: (1) the NH region is similar to Swi3 
(yeast SWI/SNF complexes regulate transcription by chromatin remodeling), indicating a 
role in transcriptional regulation, and (2) the COOH region is similar to parathyroid 
hormone-related proteins that bind parathyroid hormones. PSORT (Nakai and Horton, 
1999) predicts the protein to localize in the nucleus P=0.9600. 

The identification of this new putative hormone receptor-transcriptional regulator 
and hBAZF suggest a novel human transcriptional pathway that resembles, to some 
extent, that of Bcl-6. 

Bcl-6 suppresses transcription via the BTB domain, which recruits a complex 
containing SMRT, retinoid thyroid hormone receptor, nuclear receptor corepressor (N- 
CoR), mammalian Sin3A, and histone deacetylase (HDAC). hBAZF, which also 
possesses a BTB domain, might recruit a similar complex containing deacetylase. 
Expression data indicate that hBAZF is up-regulated while NHR is down-regulated. 
These data agree with other evidence related to tube formation. Testosterone (a steroid) 
and dexamethasone (a steroid-like molecule) strongly inhibit vessel formation, and all- 
trans retinoic acid (at-RA) and 9-cis retinoic acid (9-cis RA) stimulate capillary-like 
tubular structures (Lansink et al, 1998). 

Upon angiogenic stimulation, endothelial cells may become incompetent to 
respond to anti-angiogenic responses mediated by hormones using a dual mechanism, 
sequestering hormones and suppressing transcription. Because nHR is down-regulated 
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during in vitro angiogenesis (Example 1), this polypeptide is likely to be involved in non- 
angiogenesis-specific gene transcription. nHR is an attractive therapeutic target, 
especially in therapies that are directed at preventing vascularization. 

AAP polynucleotides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
AAP or biologically- active portions thereof. Also included in the invention are nucleic 
acid fragments sufficient for use as hybridization probes to identify AAP-encoding 
nucleic acids (e.g. , AAP mRNAs) and fragments for use as polymerase chain reaction 
(PCR) primers for the amplification and/or mutation of AAP molecules. A "nucleic acid 
molecule" includes DNA molecules (e.g., cDNA or genomic DNA), RNA molecules 
(e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and 
derivatives, fragments and homologs. The nucleic acid molecule may be single- stranded 
or double-stranded, but preferably comprises double-stranded DNA. 

1. probes 

Probes are nucleic acid sequences of variable length, preferably between at least 
about 10 nucleotides (nt), 100 nt, or many (e.g., 6,000 nt) depending on the specific use. 
Probes are used to detect identical, similar, or complementary nucleic acid sequences. 
Longer length probes can be obtained from a natural or recombinant source, are highly 
specific, and much slower to hybridize than shorter-length oligomer probes. Probes may 
be single- or double-stranded and designed to have specificity in PCR, membrane-based 
hybridization technologies, or ELISA-like technologies. Probes are substantially purified 
oligonucleotides that will hybridize under stringent conditions to at least optimally 12, 25, 
50, 100, 150, 200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence of 
SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, or 15; or an anti-sense strand nucleotide sequence of 
these sequences; or of a naturally occurring mutant of these sequences. 

The full- or partial length native sequence AAP may be used to "pull out" similar 
(homologous) sequences (Ausubel et al, 1987; Sambrook, 1989), such as: (1) full-length 
or fragments of AAP cDNA from a cDNA library from any species (e.g. human, murine, 
feline, canine, bacterial, viral, retroviral, yeast), (2) from cells or tissues, (3) variants 
within a species, and (4) homologues and variants from other species. To find related 
sequences that may encode related genes, the probe may be designed to encode unique 
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sequences or degenerate sequences. Sequences may also be genomic sequences including 
promoters, enhancer elements and introns of native sequence AAP. 

For example, an AAP coding region in another species may be isolated using such 
probes. A probe of about 40 bases is designed, based on an AAP, and made. To detect 
hybridizations, probes are labeled using, for example, radionuclides such as 32 P or 35 S, or 
enzymatic labels such as alkaline phosphatase coupled to the probe via avidin-biotin 
systems. Labeled probes are used to detect nucleic acids having a complementary 
sequence to that of an AAP in libraries of cDNA, genomic DNA or mRNA of a desired 
species. 

Such probes can be used as a part of a diagnostic test kit for identifying cells or 
tissues which mis-express an AAP, such as by measuring a level of an AAP in a sample of 
cells from a subject e.g., detecting AAP mRNA levels or determining whether a genomic 
AAP has been mutated or deleted. 

2. isolated nucleic acid 

An isolated nucleic acid molecule is separated from other nucleic acid molecules 
that are present in the natural source of the nucleic acid. Preferably, an isolated nucleic 
acid is free of sequences that naturally flank the nucleic acid {i.e., sequences located at the 
5'- and 3'-termini of the nucleic acid) in the genomic DNA of the organism from which 
the nucleic acid is derived. For example, in various embodiments, isolated AAP 
molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of 
the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). 
Moreover, an isolated nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by 
recombinant techniques, or of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, or a complement of this 
aforementioned nucleotide sequence, can be isolated using standard molecular biology 
techniques and the provided sequence information. Using all or a portion of the nucleic 
acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 as a hybridization probe, AAP 
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molecules can be isolated using standard hybridization and cloning techniques (Ausubel 
etal, 1987; Sambrook, 1989). 

PCR amplification techniques can be used to amplify AAP using cDNA, mRNA 
or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers. 
Such nucleic acids can be cloned into an appropriate vector and characterized by DNA 
sequence analysis. Furthermore, oligonucleotides corresponding to AAP sequences can 
be prepared by standard synthetic techniques, e.g., an automated DNA synthesizer. 

3. oligonucleotide 

An oligonucleotide comprises a series of linked nucleotide residues, which 
oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction 
or other application. A short oligonucleotide sequence may be based on, or designed 
from, a genomic or cDNA sequence and is used to amplify, confirm, or reveal the 
presence of an identical, similar or complementary DNA or RNA in a particular cell or 
tissue. Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 
50 nt, or 100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of 
the invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in 
length would further comprise at least 6 contiguous nucleotides of SEQ ID NOS:l, 3, 5, 
7, 9, 1 1, 13 or 15, or a complement thereof. Oligonucleotides may be chemically 
synthesized and may also be used as probes. 

4. complementary nucleic acid sequences; binding 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule that is a complement of the nucleotide sequence shown 
in SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13 or 15, or a portion of this nucleotide sequence {e.g., a 
fragment that can be used as a probe or primer or a fragment encoding a biologically- 
active portion of an AAP). A nucleic acid molecule that is complementary to the 
nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 or 15, is one that is 
sufficiently complementary to the nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 
9, 11, 13 or 15, that it can hydrogen bond with little or no mismatches to the nucleotide 
sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, thereby forming a stable 
duplex. 

"Complementary" refers to Watson-Crick or Hoogsteen base pairing between 
nucleotides units of a nucleic acid molecule, and the term "binding" means the physical 
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or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, 
van der Waals, hydrophobic interactions, and the like. A physical interaction can be 
either direct or indirect. Indirect interactions may be through or due to the effects of 
another polypeptide or compound. Direct binding refers to interactions that do not take 
place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

Nucleic acid fragments are at least 6 (contiguous) nucleic acids or at least 4 
(contiguous) amino acids, a length sufficient to allow for specific hybridization in the 
case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 
respectively, and are at most some portion less than a full-length sequence. Fragments 
may be derived from any contiguous portion of a nucleic acid or amino acid sequence of 
choice. 

5. derivatives, and analogs 

Derivatives are nucleic acid sequences or amino acid sequences formed from the 
native compounds either directly or by modification or partial substitution. Analogs are 
nucleic acid sequences or amino acid sequences that have a structure similar to, but not 
identical to, the native compound but differ from it in respect to certain components or 
side chains. Analogs may be synthetic or from a different evolutionary origin and may 
have a similar or opposite metabolic activity compared to wild type. Homologs are 
nucleic acid sequences or amino acid sequences of a particular gene that are derived from 
different species. 

Derivatives and analogs may be full length or other than full length, if the 
derivative or analog contains a modified nucleic acid or amino acid, as described below. 
Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not 
limited to, molecules comprising regions that are substantially homologous to the nucleic 
acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 
95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid 
sequence of identical size or when compared to an aligned sequence in which the 
alignment is done by a computer homology program known in the art, or whose encoding 
nucleic acid is capable of hybridizing to the complement of a sequence encoding the 
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aforementioned proteins under stringent, moderately stringent, or low stringent conditions 
(Ausubel etal, 1987). 

6. homology 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 
sequences coding for isoforms of AAP. Isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. Alternatively, 
different genes can encode isoforms. In the invention, homologous nucleotide sequences 
include nucleotide sequences encoding for an AAP of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, 
rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide sequences also 
include, but are not limited to, naturally occurring allelic variations and mutations of the 
nucleotide sequences set forth herein. A homologous nucleotide sequence does not, 
however, include the exact nucleotide sequence encoding human AAP. Homologous 
nucleic acid sequences include those nucleic acid sequences that encode conservative 
amino acid substitutions (see below) in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, as well 
as a polypeptide possessing AAP biological activity. Various biological activities of the 
AAP are described below. 

7. open reading frames 

The open reading frame (ORF) of an AAP gene encodes an AAP. An ORF is a 
nucleotide sequence that has a start codon (ATG) and terminates with one of the three 
"stop" codons (TAA, TAG, or TGA). In this invention, however, an ORF may be any 
part of a coding sequence that may or may not comprise a start codon and a stop codon. 
To achieve a unique sequence, preferable AAP ORFs encode at least 50 amino acids. 

AAP polypeptides 

1 . mature 

An AAP can encode a mature AAP. A "mature" form of a polypeptide or protein 
disclosed in the present invention is the product of a naturally occurring polypeptide or 
precursor form or proprotein. The naturally occurring polypeptide, precursor or 
proprotein includes, by way of nonlimiting example, the full-length gene product, 
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encoded by the corresponding gene. Alternatively, it may be defined as the polypeptide, 
precursor or proprotein encoded by an open reading frame described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the cell, or host cell, in 
which the gene product arises. Examples of such processing steps leading to a "mature" 
form of a polypeptide or protein include the cleavage of the N-terminal methionine 
residue encoded by the initiation codon of an open reading frame, or the proteolytic 
cleavage of a signal peptide or leader sequence. Thus a mature form arising from a 
precursor polypeptide or protein that has residues 1 to N, where residue 1 is the N- 
terminal methionine, would have residues 2 through N remaining after removal of the N- 
terminal methionine. Alternatively, a mature form arising from a precursor polypeptide 
or protein having residues 1 to N, in which an N-terminal signal sequence from residue 1 
to residue M is cleaved, would have the residues from residue M+l to residue N 
remaining. Further as used herein, a "mature" form of a polypeptide or protein may arise 
from a step of post-translational modification other than a proteolytic cleavage event. 
Such additional processes include, by way of non-limiting example, glycosylation, 
myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 
2. active 

An active AAP polypeptide or AAP polypeptide fragment retains a biological 
and/or an immunological activity similar, but not necessarily identical, to an activity of a 
naturally-occuring (wild-type) AAP polypeptide of the invention, including mature forms. 
A particular biological assay, with or without dose dependency, can be used to determine 
AAP activity. A nucleic acid fragment encoding a biologically-active portion of AAP can 
be prepared by isolating a portion of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 that encodes 
a polypeptide having an AAP biological activity (the biological activities of the AAP are 
described below), expressing the encoded portion of AAP {e.g., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of AAP. 
Immunological activity refers to the ability to induce the production of an antibody 
against an antigenic epitope possessed by a native AAP; biological activity refers to a 
function, either inhibitory or stimulatory, caused by a native AAP that excludes 
immunological activity. 
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AAP nucleic acid variants and hybridization 

1. variant polynucleotides, genes and recombinant genes The invention 
further encompasses nucleic acid molecules that differ from the nucleotide sequences 
shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 due to degeneracy of the genetic code 
and thus encode the same AAP as that encoded by the nucleotide sequences shown in 
SEQ ID NO NOS:l, 3, 5, 7, 9, 1 1, 13 or 15. An isolated nucleic acid molecule of the 
invention has a nucleotide sequence encoding a protein having an amino acid sequence 
shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. 

In addition to the AAP sequences shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 
15, DNA sequence polymorphisms that change the amino acid sequences of the AAP may 
exist within a population. For example, allelic variation among individuals will exhibit 
genetic polymorphism in an AAP. The terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an AAP, 
preferably a vertebrate AAP. Such natural allelic variations can typically result in 1-5% 
variance in an AAP. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in an AAP, which are the result of natural allelic variation and that do not 
alter the functional activity of an AAP are within the scope of the invention. 

Moreover, AAP from other species that have a nucleotide sequence that differs 
from the human sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, are contemplated. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of an 
AAP cDNAs of the invention can be isolated based on their homology to an AAP of SEQ 
ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 using cDNA-derived probes to hybridize to 
homologous AAP sequences under stringent conditions. 

"AAP variant polynucleotide" or "AAP variant nucleic acid sequence" means a 
nucleic acid molecule which encodes an active AAP that (1) has at least about 80% 
nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length 
native AAP, (2) a full-length native AAP lacking the signal peptide, (3) an extracellular 
domain of an AAP, with or without the signal peptide, or (4) any other fragment of a full- 
length AAP. Ordinarily, an AAP variant polynucleotide will have at least about 80% 
nucleic acid sequence identity, more preferably at least about 81%, 82%, 83%, 84%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% nucleic 
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acid sequence identity and yet more preferably at least about 99% nucleic acid sequence 
identity with the nucleic acid sequence encoding a full-length native AAP. An AAP 
variant polynucleotide may encode a full-length native AAP lacking the signal peptide, an 
extracellular domain of an AAP, with or without the signal sequence, or any other 
5 fragment of a full-length AAP. Variants do not encompass the native nucleotide 

sequence. 

Ordinarily, AAP variant polynucleotides are at least about 30 nucleotides in 
length, often at least about 60, 90, 120, 150, 180, 210, 240, 270, 300, 450, 600 
nucleotides in length, more often at least about 900 nucleotides in length, or more. 

0 "Percent (%) nucleic acid sequence identity" with respect to AAP-encoding 

nucleic acid sequences identified herein is defined as the percentage of nucleotides in a 
candidate sequence that are identical with the nucleotides in the AAP sequence of interest, 
after aligning the sequences and introducing gaps, if necessary, to achieve the maximum 
percent sequence identity. Alignment for purposes of determining % nucleic acid 

5 sequence identity can be achieved in various ways that are within the skill in the art, for 

instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN 
or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate 
parameters for measuring alignment, including any algorithms needed to achieve 
maximal alignment over the full length of the sequences being compared. 

0 When nucleotide sequences are aligned, the % nucleic acid sequence identity of a 

given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a 
certain % nucleic acid sequence identity to, with, or against a given nucleic acid sequence 
D) can be calculated as follows: 

> %nucleic acid sequence identity = W/Z " 100 

where 

W is the number of nucleotides cored as identical matches by the sequence 
alignment program's or algorithm's alignment of C and D 
and 

Z is the total number of nucleotides in D. 
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When the length of nucleic acid sequence C is not equal to the length of nucleic 
acid sequence D, the % nucleic acid sequence identity of C to D will not equal the % 
nucleic acid sequence identity of D to C. 
2. Stringency 

Homologs (i.e., nucleic acids encoding an AAP derived from species other than 
human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or 
high stringency hybridization with all or a portion of the particular human sequence as a 
probe using methods well known in the art for nucleic acid hybridization and cloning. 

The specificity of single stranded DNA to hybridize complementary fragments is 
determined by the "stringency" of the reaction conditions. Hybridization stringency 
increases as the propensity to form DNA duplexes decreases. In nucleic acid 
hybridization reactions, the stringency can be chosen to either favor specific 
hybridizations (high stringency), which can be used to identify, for example, full-length 
clones from a library. Less-specific hybridizations (low stringency) can be used to 
identify related, but not exact, DNA molecules (homologous, but not identical) or 
segments. 

DNA duplexes are stabilized by: (1) the number of complementary base pairs, (2) 
the type of base pairs, (3) salt concentration (ionic strength) of the reaction mixture, (4) 
the temperature of the reaction, and (5) the presence of certain organic solvents, such as 
formamide which decreases DNA duplex stability. In general, the longer the probe, the 
higher the temperature required for proper annealing. A common approach is to vary the 
temperature: higher relative temperatures result in more stringent reaction conditions. 
(Ausubel et al, 1987) provide an excellent explanation of stringency of hybridization 
reactions. 

To hybridize under "stringent conditions" describes hybridization protocols in 
which nucleotide sequences at least 60% homologous to each other remain hybridized. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm 
is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium. Since the target sequences are generally present at excess, at 
Tm, 50% of the probes are occupied at equilibrium. 
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(a) high stringency 

"Stringent hybridization conditions" conditions enable a probe, primer or 
oligonucleotide to hybridize only to its target sequence. Stringent conditions are 
sequence-dependent and will differ. Stringent conditions comprise: (1) low ionic strength 
and high temperature washes (e.g. 15 mM sodium chloride, 1.5 mM sodium citrate, 0.1 % 
sodium dodecyl sulfate at 50°C); (2) a denaturing agent during hybridization (e.g. 50% 
(v/v) formamide, 0.1% bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 
50mM sodium phosphate buffer (pH 6.5; 750 mM sodium chloride, 75 mM sodium 
citrate at 42°C); or (3) 50% formamide. Washes typically also comprise 5X SSC (0.75 M 
NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 
pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 ug/ml), 0.1% 
SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55°C, followed by a high-stringency wash 
consisting of 0.1 x SSC containing EDTA at 55°C. Preferably, the conditions are such 
that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous 
to each other typically remain hybridized to each other. These conditions are presented as 
examples and are not meant to be limiting. 

(b) moderate stringency 

"Moderately stringent conditions" use washing solutions and hybridization 
conditions that are less stringent (Sambrook, 1989), such that a polynucleotide will 
hybridize to the entire, fragments, derivatives or analogs of SEQ ID NOS:l, 3, 5, 7, 9, 11, 
13 or 15. One example comprises hybridization in 6X SSC, 5X Denhardt's solution, 0.5% 
SDS and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more 
washes in IX SSC, 0.1% SDS at 37°C. The temperature, ionic strength, etc., can be 
adjusted to accommodate experimental factors such as probe length. Other moderate 
stringency conditions are described in (Ausubel et al, 1987; Kriegler, 1990). 

(c) low stringency 

"Low stringent conditions" use washing solutions and hybridization conditions 
that are less stringent than those for moderate stringency (Sambrook, 1989), such that a 
polynucleotide will hybridize to the entire, fragments, derivatives or analogs of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13 or 15. A non-limiting example of low stringency hybridization 
conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 
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mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 
25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low 
stringency, such as those for cross-species hybridizations are described in (Ausubel et ah, 
1987; Kriegler, 1990; Shilo and Weinberg, 1981). 
3. Conservative mutations 

In addition to naturally-occurring allelic variants of AAP, changes can be 
introduced by mutation into SEQ ID NO NOS:l, 3, 5, 7, 9, 11, 13 or 15 sequences that 
incur alterations in the amino acid sequences of the encoded AAP that do not alter the 
AAP function. For example, nucleotide substitutions leading to amino acid substitutions 
at "non-essential" amino acid residues can be made in the sequence of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14 or 16. A "non-essential" amino acid residue is a residue that can be 
altered from the wild-type sequences of the AAP without altering their biological activity, 
whereas an "essential" amino acid residue is required for such biological activity. For 
example, amino acid residues that are conserved among the AAP of the invention are 
predicted to be particularly non-amenable to alteration. Amino acids for which 
conservative substitutions can be made are well-known in the art. 

Useful conservative substitutions are shown in Table A, "Preferred substitutions." 
Conservative substitutions whereby an amino acid of one class is replaced with another 
amino acid of the same type fall within the scope of the subject invention so long as the 
substitution does not materially alter the biological activity of the compound. If such 
substitutions result in a change in biological activity, then more substantial changes, 
indicated in Table B as exemplary are introduced and the products screened for an AAP 
polypeptide's biological activity. 



Table A Preferred substitutions 



Original residue 


Exemplary substitutions 


Preferred substitutions 


Ala (A) 


Val, Leu, lie 


Val 


Arg (R) 


Lys, Gin, Asn 


Lys 


Asn (N) 


Gin, His, Lys, Arg 


Gin 


Asp (D) 


Glu 


Glu 


Cys (C) 


Ser 


Ser 


Gin (Q) 


Asn 


Asn 


Glu (E) 


Asp 


Asp 
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Gly (G) 


Pro, Ala 


Ala 


His (H) 


Asn, Gin, Lys, Arg 


Arg 


He (I) 


Leu, Val, Met, Ala, Phe, 
Norleucine 


Leu 


Leu (L) 


Norleucine, He, Val, Met, Ala, 
Phe 


He 


Lys (K) 


Arg, Gin, Asn 


Arg 


Met (M) 


Leu, Phe, He 


Leu 


Phe (F) 


Leu, Val, He, Ala, Tyr 


Leu 


Pro (P) 


Ala 


Ala 


Ser (S) 


Thr 


Thr 


Thr (T) 


Ser 


Ser 


Trp(W) 


Tyr, Phe 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Phe 


Val (V) 


lie, Leu, Met, Phe, Ala, 
Norleucine 


Leu 



Non-conservative substitutions that effect (1) the structure of the polypeptide 
backbone, such as a p-sheet or a-helical conformation, (2) the charge or (3) 
hydrophobicity, or (4) the bulk of the side chain of the target site can modify an AAP 
polypeptide's function or immunological identity. Residues are divided into groups based 
on common side-chain properties as denoted in Table B. Non-conservative substitutions 
entail exchanging a member of one of these classes for another class. Substitutions may 
be introduced into conservative substitution sites or more preferably into non-conserved 
sites. 



Table B Amino acid classes 



Class 


Amino acids 


hydrophobic 


Norleucine, Met, Ala, Val, Leu, lie 


neutral hydrophilic 


Cys, Ser, Thr 


acidic 


Asp, Glu 


basic 


Asn, Gin, His, Lys, Arg 


disrupt chain conformation 


Gly, Pro 


aromatic 


Trp, Tyr, Phe 



The variant polypeptides can be made using methods known in the art such as 
oligonucleotide-mediated (site-directed) mutagenesis, alanine scanning, and PCR 
mutagenesis. Site-directed mutagenesis (Carter, 1986; Zoller and Smith, 1987), cassette 
mutagenesis, restriction selection mutagenesis (Wells et al, 1985) or other known 
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techniques can be performed on the cloned DNA to produce the AAP variant DNA 
(Ausubel etal, 1987; Sambrook, 1989). 

In one embodiment, the isolated nucleic acid molecule comprises a nucleotide 
sequence encoding a protein, wherein the protein comprises an amino acid sequence at 
least about 45%, preferably 60%, more preferably 70%, 80%, 90%, and most preferably 
about 95% homologous to SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, or 15. 

A mutant AAP can be assayed for blocking angiogenesis in vitro. 

4. Anti-sense nucleic acids 

Using antisense and sense AAP oligonucleotides can prevent AAP polypeptide 
expression. These oligonucleotides bind to target nucleic acid sequences, forming 
duplexes that block transcription or translation of the target sequence by enhancing 
degradation of the duplexes, terminating prematurely transcription or translation, or by 
other means. 

Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA 
or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) 
sequences. Anti-sense nucleic acids can be designed according to Watson and Crick or 
Hoogsteen base pairing rules. The anti-sense nucleic acid molecule can be 
complementary to the entire coding region of an AAP mRNA, but more preferably, to 
only a portion of the coding or noncoding region of an AAP mRNA. For example, the 
anti-sense oligonucleotide can be complementary to the region surrounding the translation 
start site of an AAP mRNA. Antisense or sense oligonucleotides may comprise a 
fragment of the AAP DNA coding region of at least about 14 nucleotides, preferably from 
about 14 to 30 nucleotides. In general, antisense RNA or DNA molecules can comprise 
at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 bases in 
length or more. Among others, (Stein and Cohen, 1988; van der Krol et at, 1988a) 
describe methods to derive antisense or a sense oligonucleotides from a given cDNA 
sequence. 

Examples of modified nucleotides that can be used to generate the anti-sense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
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methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5 -methyl- 2-thiouracil, 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
anti-sense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been sub-cloned in an anti-sense orientation such that the 
transcribed RNA will be complementary to a target nucleic acid of interest. 

To introduce antisense or sense oligonucleotides into target cells (cells containing 
the target nucleic acid sequence), any gene transfer method may be used. Examples of 
gene transfer methods include (1) biological, such as gene transfer vectors like Epstein- 
Barr virus or conjugating the exogenous DNA to a ligand-binding molecule, (2) physical, 
such as electroporation and injection, and (3) chemical, such as CaP0 4 precipitation and 
oligonucleotide-lipid complexes. 

An antisense or sense oligonucleotide is inserted into a suitable gene transfer 
retroviral vector. A cell containing the target nucleic acid sequence is contacted with the 
recombinant retroviral vector, either in vivo or ex vivo. Examples of suitable retroviral 
vectors include those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and 
DCT5C (WO 90/13641, 1990). To achieve sufficient nucleic acid molecule 
transcription, vector constructs in which the transcription of the anti-sense nucleic acid 
molecule is controlled by a strong pol II or pol HI promoter are preferred. 

To specify target cells in a mixed population of cells cell surface receptors that are 
specific to the target cells can be exploited. Antisense and sense oligonucleotides are 
conjugated to a ligand-binding molecule, as described in (WO 91/04753, 1991). Ligands 
are chosen for receptors that are specific to the target cells. Examples of suitable ligand- 
binding molecules include cell surface receptors, growth factors, cytokines, or other 
ligands that bind to cell surface receptors or molecules. Preferably, conjugation of the 
ligand-binding molecule does not substantially interfere with the ability of the receptors 
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or molecule to bind the ligand-binding molecule conjugate, or block entry of the sense or 
antisense oligonucleotide or its conjugated version into the cell. 

Liposomes efficiently transfer sense or an antisense oligonucleotide to cells (WO 
90/1 0448, 1990). The sense or antisense oligonucleotide-lipid complex is preferably 
dissociated within the cell by an endogenous lipase. 

The anti-sense nucleic acid molecule of the invention may be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gautier et al, 1987). The anti-sense nucleic acid 
molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al, 1987a) or a 
chimeric RNA-DNA analogue (Inoue et al. , 1987b). 

In one embodiment, an anti-sense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes, such as hammerhead ribozymes (Haseloff and 
Gerlach, 1988) can be used to catalytically cleave AAP mRNA transcripts and thus inhibit 
translation. A ribozyme specific for an AAP-encoding nucleic acid can be designed 
based on the nucleotide sequence of an AAP cDNA (i.e., SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 
or 15). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide 
sequence to be cleaved in an AAP-encoding mRNA (Cech et al, U.S. Patent No. 
5,116,742, 1992; Cech etal, U.S. Patent No. 4,987,071, 1991). An AAP mRNA can also 
be used to select a catalytic RNA having a specific ribonuclease activity from a pool of 
RNA molecules (Battel and Szostak, 1993). 

Alternatively, AAP expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of an AAP (e.g., an AAP promoter and/or 
enhancers) to form triple helical structures that prevent transcription of an AAP in target 
cells (Helene, 1991; Helene et al, 1992; Maher, 1992). 

Modifications of antisense and sense oligonucleotides can augment their 
effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 
91/06629, 1991), increase in vivo stability by conferring resistance to endogenous 
nucleases without disrupting binding specificity to target sequences. Other modifications 
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can increase the affinities of the oligonucleotides for their targets, such as covalently 
linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments 
modify binding specificities of the oligonucleotides for their targets, including metal 
complexes or intercalating (e.g. ellipticine) and alkylating agents. 

5 For example, the deoxyribose phosphate backbone of the nucleic acids can be 

modified to generate peptide nucleic acids (Hyrup and Nielsen, 1996). "Peptide nucleic 
acids" or "PNAs" refer to nucleic acid mimics (e.g., DNA mimics) in that the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs allows for specific 

0 hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of 

PNA oligomers can be performed using standard solid phase peptide synthesis protocols 
(Hyrup and Nielsen, 1996; Perry-O'Keefe et al, 1996). 

PNAs of an AAP can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as anti-sense or antigene agents for sequence-specific 

5 modulation of gene expression by inducing transcription or translation arrest or inhibiting 

replication. AAP PNAs may also be used in the analysis of single base pair mutations 
(e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., Si nucleases (Hyrup and Nielsen, 1996); or as 
probes or primers for DNA sequence and hybridization (Hyrup and Nielsen, 1996; Perry- 

0 O'Keefe etal, 1996). 

PNAs of an AAP can be modified to enhance their stability or cellular uptake. 
Lipophilic or other helper groups may be attached to PNAs, PNA-DNA dimmers formed, 
or the use of liposomes or other drug delivery techniques. For example, PNA-DNA 
chimeras can be generated that may combine the advantageous properties of PNA and 

5 DNA. Such chimeras allow DNA recognition enzymes (e.g., RNase H and DNA 

polymerases) to interact with the DNA portion while the PNA portion provides high 
binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of 
appropriate lengths selected in terms of base stacking, number of bonds between the 
nucleobases, and orientation (Hyrup and Nielsen, 1996). The synthesis of PNA-DNA 
chimeras can be performed (Finn et al, 1996; Hyrup and Nielsen, 1996). For example, a 
DNA chain can be synthesized on a solid support using standard phosphoramidite 
coupling chemistry, and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'- 
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deoxy-thymidine phosphoramidite, can be used between the PNA and the 5' end of DNA 
(Finn et al, 1996; Hyrup and Nielsen, 1996). PNA monomers are then coupled in a 
stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn et al, 1996). Alternatively, chimeric molecules can be synthesized with a 
5' DNA segment and a 3' PNA segment (Petersen et al, 1976). 

The oligonucleotide may include other appended groups such as peptides (e.g., for 
targeting host cell receptors in vivo), or agents facilitating transport across the cell 
membrane (Lemaitre et al, 1987; Letsinger et al, 1989) or PCT Publication No. 
WO88/09810) or the blood-brain barrier (e.g., PCT Publication No. WO 89/10134). In 
addition, oligonucleotides can be modified with hybridization-triggered cleavage agents 
(van der Krol et al, 1988b) or intercalating agents (Zon, 1988). The oligonucleotide may 
be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking 
agent, a transport agent, a hybridization-triggered cleavage agent, and the like. 

AAP polypeptides 

One aspect of the invention pertains to isolated AAP, and biologically-active 
portions derivatives, fragments, analogs or homologs thereof. Also provided are 
polypeptide fragments suitable for use as immunogens to raise anti-AAP Abs. In one 
embodiment, a native AAP can be isolated from cells or tissue sources by an appropriate 
purification scheme using standard protein purification techniques. In another 
embodiment, AAP are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an AAP can be synthesized chemically using standard peptide 
synthesis techniques. 

1. Polypeptides 

An AAP polypeptide includes the amino acid sequence of an AAP whose 
sequences are provided in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residues shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, while still 
encoding a protein that maintains its AAP activities and physiological functions, or a 
functional fragment thereof. 

2. Variant AAP polypeptides 
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In genera], an AAP variant that preserves an AAP-like function and includes any 
variant in which residues at a particular position in the sequence have been substituted by 
other amino acids, and further includes the possibility of inserting an additional residue or 
residues between two residues of the parent protein as well as the possibility of deleting 
one or more residues from the parent sequence. Any amino acid substitution, insertion, or 
deletion is encompassed by the invention. In favorable circumstances, the substitution is 
a conservative substitution as defined above. 

"AAP polypeptide variant" means an active AAP polypeptide having at least: (1) 
about 80% amino acid sequence identity with a full-length native sequence AAP 
polypeptide sequence, (2) an AAP polypeptide sequence lacking the signal peptide, (3) an 
extracellular domain of an AAP polypeptide, with or without the signal peptide, or (4) 
any other fragment of a full-length AAP polypeptide sequence. For example, AAP 
polypeptide variants include AAP polypeptides wherein one or more amino acid residues 
are added or deleted at the N- or C- terminus of the full-length native amino acid 
sequence. An AAP polypeptide variant will have at least about 80% amino acid sequence 
identity, preferably at least about 81% amino acid sequence identity, more preferably at 
least about 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98% amino acid sequence identity and most preferably at least about 
99% amino acid sequence identity with a full-length native sequence AAP polypeptide 
sequence. An AAP polypeptide variant may have a sequence lacking the signal peptide, 
an extracellular domain of an AAP polypeptide, with or without the signal peptide, or any 
other fragment of a full-length AAP polypeptide sequence. Ordinarily, AAP variant 
polypeptides are at least about 10 amino acids in length, often at least about 20 amino 
acids in length, more often at least about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, or 300 
amino acids in length, or more. 

"Percent (%) amino acid sequence identity" is defined as the percentage of amino 
acid residues that are identical with amino acid residues in a disclosed AAP polypeptide 
sequence in a candidate sequence when the two sequences are aligned. To determine % 
amino acid identity, sequences are aligned and if necessary, gaps are introduced to 
achieve the maximum % sequence identity; conservative substitutions are not considered 
as part of the sequence identity. Amino acid sequence alignment procedures to determine 
percent identity are well known to those of skill in the art. Often publicly available 
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computer software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) 
software is used to align peptide sequences. Those skilled in the art can determine 
appropriate parameters for measuring alignment, including any algorithms needed to 
achieve maximal alignment over the full length of the sequences being compared. 
5 When amino acid sequences are aligned, the % amino acid sequence identity of a 

given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence 
B) can be calculated as: 

0 

%amino acid sequence identity = X/Y " 100 
where 

X is the number of amino acid residues scored as identical matches by the 
5 sequence alignment program's or algorithm's alignment of A and B 

and 

Y is the total number of amino acid residues in B. 

If the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
0 acid sequence identity of B to A. 

3. Isolated/purified polypeptides 

An "isolated" or "purified" polypeptide, protein or biologically active fragment is 
separated and/or recovered from a component of its natural environment. Contaminant 
components include materials that would typically interfere with diagnostic or therapeutic 
3 uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or 

non-proteinaceous materials. Preferably, the polypeptide is purified to a sufficient degree 
to obtain at least 15 residues of N-terminal or internal amino acid sequence. To be 
substantially isolated, preparations having less than 30% by dry weight of non-AAP 
contaminating material (contaminants), more preferably less than 20%, 10% and most 
preferably less than 5% contaminants. An isolated, recombinantly-produced AAP or 
biologically active portion is preferably substantially free of culture medium, i.e., culture 
medium represents less than 20%, more preferably less than about 10%, and most 



-76- 

preferably less than about 5% of the volume of the AAP preparation. Examples of 
contaminants include cell debris, culture media, and substances used and produced during 
in vitro synthesis of an AAP. 

4. Biologically active 

5 Biologically active portions of an AAP include peptides comprising amino acid 

sequences sufficiently homologous to or derived from the amino acid sequences of an 
AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16) that include fewer amino acids than a 
full-length AAP, and exhibit at least one activity of an AAP. Biologically active portions 
comprise a domain or motif with at least one activity of a native AAP. A biologically 

0 active portion of an AAP can be a polypeptide that is, for example, 10, 25, 50, 100 or 

more amino acid residues in length. Other biologically active portions, in which other 
regions of the protein are deleted, can be prepared by recombinant techniques and 
evaluated for one or more of the functional activities of a native AAP. 

Biologically active portions of an AAP may have an amino acid sequence shown 

5 in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, or substantially homologous to SEQ ID 

NOS:2, 4, 6, 8, 10, 12, 14 or 16, and retains the functional activity of the protein of SEQ 
ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, yet differs in amino acid sequence due to natural 
allelic variation or mutagenesis. Other biologically active AAP may comprise an amino 
acid sequence at least 45% homologous to the amino acid sequence of SEQ ID NOS:2, 4, 

0 6, 8, 10, 12, 14 or 16, and retains the functional activity of native AAP. 

5. Determining homology between two or more sequences 

"AAP variant" means an active AAP having at least: (1) about 80% amino acid 
sequence identity with a full-length native sequence AAP sequence, (2) an AAP sequence 
lacking the signal peptide, (3) an extracellular domain of an AAP, with or without the 

5 signal peptide, or (4) any other fragment of a full-length AAP sequence. For example, 

AAP variants include an AAP wherein one or more amino acid residues are added or 
deleted at the N- or C- terminus of the full-length native amino acid sequence. An AAP 
variant will have at least about 80% amino acid sequence identity, preferably at least 
about 81% amino acid sequence identity, more preferably at least about 82%, 83%, 84%, 

) 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% amino 

acid sequence identity and most preferably at least about 99% amino acid sequence 
identity with a full-length native sequence AAP sequence. An AAP variant may have a 
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sequence lacking the signal peptide, an extracellular domain of an AAP, with or without 
the signal peptide, or any other fragment of a full-length AAP sequence. Ordinarily, AAP 
variant polypeptides are at least about 10 amino acids in length, often at least about 20 
amino acids in length, more often at least about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 
or 300 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" is defined as the percentage of amino 
acid residues that are identical with amino acid residues in a disclosed AAP sequence in a 
candidate sequence when the two sequences are aligned. To determine % amino acid 
identity, sequences are aligned and if necessary, gaps are introduced to achieve the 
maximum % sequence identity; conservative substitutions are not considered as part of 
the sequence identity. Amino acid sequence alignment procedures to determine percent 
identity are well known to those of skill in the art. Often publicly available computer 
software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) software is used 
to align peptide sequences. Those skilled in the art can determine appropriate parameters 
for measuring alignment, including any algorithms needed to achieve maximal alignment 
over the full length of the sequences being compared. 

When amino acid sequences are aligned, the % amino acid sequence identity of a 
given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence 
B) can be calculated as: 

% amino acid sequence identity = X/Y ' 100 
where 

X is the number of amino acid residues scored as identical matches by the 
sequence alignment program's or algorithm's alignment of A and B 
and 

Y is the total number of amino acid residues in B. 

If the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 
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6. Chimeric and fusion proteins 

Fusion polypeptides are useful in expression studies, cell-localization, bioassays, 
and AAP purification. An AAP "chimeric protein" or "fusion protein" comprises an AAP 

5 fused to a non-AAP polypeptide. A non-AAP polypeptide is not substantially 

homologous to an AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16). An AAP fusion 
protein may include any portion to an entire AAP, including any number of the 
biologically active portions. An AAP may be fused to the C-terminus of the GST 
(glutathione S -transferase) sequences. Such fusion proteins facilitate the purification of a 

0 recombinant AAP. In certain host cells, (e.g. mammalian), heterologous signal 

sequences fusions may ameliorate AAP expression and/or secretion. Additional 
exemplary fusions are presented in Table C. 

Other fusion partners can adapt an AAP therapeutically. Fusions with members of 
the immunoglobulin (Ig) protein family are useful in therapies that inhibit an AAP Iigand 

5 or substrate interactions, consequently suppressing an AAP-mediated signal transduction 

in vivo. Such fusions, incorporated into pharmaceutical compositions, may be used to 
treat proliferative and differentiation disorders, as well as modulating cell survival. An 
AAP-Ig fusion polypeptides can also be used as immunogens to produce an anti-AAP 
Abs in a subject, to purify AAP ligands, and to screen for molecules that inhibit 

D interactions of an AAP with other molecules. 

Fusion proteins can be easily created using recombinant methods. A nucleic acid 
encoding an AAP can be fused in-frame with a non-AAP encoding nucleic acid, to an 
AAP NH 2 - or COO- -terminus, or internally. Fusion genes may also be synthesized by 
conventional techniques, including automated DNA synthesizers. PCR amplification 

> using anchor primers that give rise to complementary overhangs between two consecutive 

gene fragments that can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (Ausubel et al, 1987) is also useful. Many vectors are commercially 
available that facilitate sub-cloning an AAP in-frame to a fusion moiety. 



Table C Useful non-AAP fusion polypeptides 



Reporter [ in vitro \ in vivo | Notes | Reference" 



-79- 



Human growth 
hormone (hGH) 


Radioimmuno- 
assay 


none 


Expensive, 
insensitive, 
narrow linear 
range. 


(Selden et al, 
1986) 


p-glucu- 
ronidase (GUS) 


Colori metric, 
fluorescent, or 
chemi- 
luminescent 


colorimetric 
(histo-chemical 
staining with X- 
gluc) 


sensitive, 
broad linear 
range, non- 
iostopic. 


(Gallagher, 
1992) 


Green 
fluorescent 
protein (GFP) 
and related 
molecules (RFP, 
BFP, AAP, etc.) 


Fluorescent 


fluorescent 


can be used in 
live cells; 
resists photo- 
bleaching 


(Chalfie et al, 
1994) 


Luciferase 
(firefly) 


bioluminsecent 


Bio- 
luminescent 


protein is 
unstable, 
difficult to 
reproduce, 
signal is brief 


(de Wet et al, 
1987) 


Chloramphenico 
al 

acetyltransferas 
e (CAT) 


Chromato- 
graphy, 
differential 
extraction, 
fluorescent, or 
immunoassay 


none 


Expensive 
radioactive 
substrates, 
time- 
consuming, 
insensitive, 
narrow linear 
range 


(Gorman et al, 
1982) 


P-galacto-sidase 


colorimetric, 
fluorescence, 
chemi- 
luminscence 


colorimetric 
(histochemical 
staining with X- 
gal), bio- 
luminescent in 
live cells 


sensitive, 
broad linear 
range; some 
cells have high 
endogenous 
activity 


(Alam and 
Cook, 1990) 


Secrete alkaline 

phosphatase 

(SEAP) 


colorimetric, 
bioluminescent, 
chemi- 
luminescent 


none 


Chem- 
iluminscence 
assay is 
sensitive and 
broad linear 
range; some 
cells have 
endogenouse 
alkaline 
phosphatase 
activity 


(Berger et al, 
1988) 



Therapeutic applications of AAP 
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1. Agonists and antagonists 

"Antagonist" includes any molecule that partially or fully blocks, inhibits, or 
neutralizes a biological activity of an endogenous AAP. Similarly, "agonist" includes any 
molecule that mimics a biological activity of an endogenous AAP. Molecules that can act 
as agonists or antagonists include Abs or antibody fragments, fragments or variants of an 
endogenous AAP, peptides, antisense oligonucleotides, small organic molecules, etc. 

2. Identifying antagonists and agonists 

To assay for antagonists, an AAP is added to, or expressed in, a cell along with 
the compound to be screened for a particular activity. If the compound inhibits the 
activity of interest in the presence of an AAP, that compound is an antagonist to the AAP; 
if an AAP activity is enhanced, the compound is an agonist. 

(a) Specific examples of potential antagonists and agonist 

Any molecule that alters AAP cellular effects is a candidate antagonist or agonist. 
Screening techniques well known to those skilled in the art can identify these molecules. 
Examples of antagonists and agonists include: (1) small organic and inorganic 
compounds, (2) small peptides, (3) Abs and derivatives, (4) polypeptides closely related 
to an AAP, (5) antisense DNA and RNA, (6) ribozymes, (7) triple DNA helices and (8) 
nucleic acid aptamers. 

Small molecules that bind to an AAP active site or other relevant part of the 
polypeptide and inhibit the biological activity of the AAP are antagonists. Examples of 
small molecule antagonists include small peptides, peptide-like molecules, preferably 
soluble, and synthetic non-peptidyl organic or inorganic compounds. These same 
molecules, if they enhance an AAP activity, are examples of agonists. 

Almost any antibody that affects an AAP's function is a candidate antagonist, and 
occasionally, agonist. Examples of antibody antagonists include polyclonal, monoclonal, 
single-chain, anti-idiotypic, chimeric Abs, or humanized versions of such Abs or 
fragments. Abs may be from any species in which an immune response can be raised. 
Humanized Abs are also contemplated. 

Alternatively, a potential antagonist or agonist may be a closely related protein, 
for example, a mutated form of an AAP that recognizes an AAP-interacting protein but 
imparts no effect, thereby competitively inhibiting AAP action. Alternatively, a mutated 
AAP may be constitutively activated and may act as an agonist. 
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Antisense RNA or DNA constructs can be effective antagonists. Antisense RNA 
or DNA molecules block function by inhibiting translation by hybridizing to targeted 
mRNA. Antisense technology can be used to control gene expression through triple-helix 
formation or antisense DNA or RNA, both of which depend on polynucleotide binding to 
DNA or RNA. For example, the 5' coding portion of an AAP sequence is used to design 
an antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA 
oligonucleotide is designed to be complementary to a region of the gene involved in 
transcription (triple helix) (Beal and Dervan, 1991; Cooney et al, 1988; Lee et al, 1979), 
thereby preventing transcription and the production of the AAP. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into the AAP (antisense) (Cohen, 1989; Okano et al, 1991). These 
oligonucleotides can also be delivered to cells such that the antisense RNA or DNA may 
be expressed in vivo to inhibit production of the AAP. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation-initiation site, e.g., between about 
-10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 
cleavage of RNA. Ribozymes act by sequence-specific hybridization to the 
complementary target RNA, followed by endonucleolytic cleavage. Specific ribozyme 
cleavage sites within a potential RNA target can be identified by known techniques (WO 
97/33551, 1997; Rossi, 1994). 

To inhibit transcription, triple-helix nucleic acids that are single-stranded and 
comprise deoxynucleotides are useful antagonists. These oligonucleotides are designed 
such that triple-helix formation via Hoogsteen base-pairing rules is promoted, generally 
requiring stretches of purines or pyrimidines (WO 97/33551, 1997). 

Aptamers are short oligonucleotide sequences that can be used to recognize and 
specifically bind almost any molecule. The systematic evolution of ligands by 
exponential enrichment (SELEX) process (Ausubel et al, 1987; Ellington and Szostak, 
1990; Tuerk and Gold, 1990) is powerful and can be used to find such aptamers. 
Aptamers have many diagnostic and clinical uses; almost any use in which an antibody 
has been used clinically or diagnostically, aptamers too may be used. In addition, they 
are cheaper to make once they have been identified, and can be easily applied in a variety 
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of formats, including administration in pharmaceutical compositions, in bioassays, and 
diagnostic tests (Jayasena, 1999). 

Anti-AAPAbs 

The invention encompasses Abs and antibody fragments, such as F ab or (F ab )2, that 
bind immunospecifically to any AAP epitopes. 

"Antibody" (Ab) comprises single Abs directed against an AAP (anti-AAP Ab; 
including agonist, antagonist, and neutralizing Abs), anti-AAP Ab compositions with 
poly-epitope specificity, single chain anti-AAP Abs, and fragments of anti-AAP Abs. A 
"monoclonal antibody" is obtained from a population of substantially homogeneous Abs, 
i.e., the individual Abs comprising the population are identical except for possible 
naturally-occurring mutations that may be present in minor amounts. Exemplary Abs 
include polyclonal (pAb), monoclonal (mAb), humanized, bi-specific (bsAb), and 
heteroconjugate Abs. 

1 . Polyclonal Abs (pAbs) 

Polyclonal Abs can be raised in a mammalian host, for example, by one or more 
injections of an immunogen and, if desired, an adjuvant. Typically, the immunogen 
and/or adjuvant are injected in the mammal by multiple subcutaneous or intraperitoneal 
injections. The immunogen may include an AAP or a fusion protein. Examples of 
adjuvants include Freund's complete and monophosphoryl Lipid A synthetic-trehalose 
dicorynomycolate (MPL-TDM). To improve the immune response, an immunogen may 
be conjugated to a protein that is immunogenic in the host, such as keyhole limpet 
hemocyanin (KLH), serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
Protocols for antibody production are described by (Ausubel et al., 1987; Harlow and 
Lane, 1988). Alternatively, pAbs may be made in chickens, producing IgY molecules 
(Schadeef a/., 1996). 

2. Monoclonal Abs ( mAbs ) 

Anti-AAP mAbs may be prepared using hybridoma methods (Milstein and Cuello, 
1983). Hybridoma methods comprise at least four steps: (1) immunizing a host, or 
lymphocytes from a host; (2) harvesting the mAb secreting (or potentially secreting) 
lymphocytes, (3) fusing the lymphocytes to immortalized cells, and (4) selecting those 
cells that secrete the desired (anti-AAP) mAb. 
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A mouse, rat, guinea pig, hamster, or other appropriate host is immunized to elicit 
lymphocytes that produce or are capable of producing Abs that will specifically bind to 
the immunogen. Alternatively, the lymphocytes may be immunized in vitro. If human 
cells are desired, peripheral blood lymphocytes (PBLs) are generally used; however, 
spleen cells or lymphocytes from other mammalian sources are preferred. The 
immunogen typically includes an AAP or a fusion protein. 

The lymphocytes are then fused with an immortalized cell line to form hybridoma 
cells, facilitated by a fusing agent such as polyethylene glycol (Goding, 1996). Rodent, 
bovine, or human myeloma cells immortalized by transformation may be used, or rat or 
mouse myeloma cell lines. Because pure populations of hybridoma cells and not unfused 
immortalized cells are preferred, the cells after fusion are grown in a suitable medium that 
contains one or more substances that inhibit the growth or survival of unfused, 
immortalized cells. A common technique uses parental cells that lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT). In this case, 
hypoxanthine, aminopterin and thymidine are added to the medium (HAT medium) to 
prevent the growth of HGPRT-deficient cells while permitting hybridomas to grow. 

Preferred immortalized cells fuse efficiently, can be isolated from mixed 
populations by selecting in a medium such as HAT, and support stable and high-level 
expression of antibody after fusion. Preferred immortalized cell lines are murine 
myeloma lines, available from the American Type Culture Collection (Manassas, VA). 
Human myeloma and mouse-human heteromyeloma cell lines also have been described 
for the production of human mAbs (Kozbor et ai, 1984; Schook, 1987). 

Because hybridoma cells secrete antibody extracellularly, the culture media can be 
assayed for the presence of mAbs directed against an AAP (anti-AAP mAbs). 
Immunoprecipitation or in vitro binding assays, such as radio immunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA), measure the binding specificity of 
mAbs (Harlow and Lane, 1988; Harlow and Lane, 1999), including Scatchard analysis 
(Munson and Rodbard, 1980). 

Anti-AAP mAb secreting hybridoma cells may be isolated as single clones by 
limiting dilution procedures and sub-cultured (Goding, 1996). Suitable culture media 
include Dulbecco's Modified Eagle's Medium, RPMI-1640, or if desired, a protein-free 
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or -reduced or serum -free medium (e.g., Ultra DOMA PF or HL-1; Biowhittaker; 
Walkersville, MD). The hybridoma cells may also be grown in vivo as ascites. 

The mAbs may be isolated or purified from the culture medium or ascites fluid by 
conventional Ig purification procedures such as protein A-Sepharose, hydroxylapatite 
chromatography, gel electrophoresis, dialysis, ammonium sulfate precipitation or affinity 
chromatography (Harlow and Lane, 1988; Harlow and Lane, 1999). 

The mAbs may also be made by recombinant methods (U.S. Patent No. 4166452, 
1979). DNA encoding anti-AAP mAbs can be readily isolated and sequenced using 
conventional procedures, e.g., using oligonucleotide probes that specifically bind to 
murine heavy and light antibody chain genes, to probe preferably DNA isolated from 
anti-AAP-secreting mAb hybridoma cell lines. Once isolated, the isolated DNA 
fragments are sub-cloned into expression vectors that are then transfected into host cells 
such as simian COS-7 cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do 
not otherwise produce Ig protein, to express mAbs. The isolated DNA fragments can be 
modified, for example, by substituting the coding sequence for human heavy and light 
chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4816567, 1989; Morrison et ah, 1987), or by fusing the Ig coding sequence to all or part 
of the coding sequence for a non-Ig polypeptide. Such a non-Ig polypeptide can be 
substituted for the constant domains of an antibody, or can be substituted for the variable 
domains of one antigen-combining site to create a chimeric bivalent antibody. 

3 . Monovalent Abs 

The Abs may be monovalent Abs that consequently do not cross-link with each 
other. For example, one method involves recombinant expression of Ig light chain and 
modified heavy chain. Heavy chain truncations generally at any point in the F c region 
will prevent heavy chain cross-linking. Alternatively, the relevant cysteine residues are 
substituted with another amino acid residue or are deleted, preventing crosslinking. In 
vitro methods are also suitable for preparing monovalent Abs. Abs can be digested to 
produce fragments, such as F a b fragments (Harlow and Lane, 1988; Harlow and Lane, 
1999). 

4. Humanized and human Abs 

Anti-AAP Abs may further comprise humanized or human Abs. Humanized 
forms of non-human Abs are chimeric Igs, Ig chains or fragments (such as F v , F ab , F ab s 
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F ( ab')2 or other antigen-binding subsequences of Abs) that contain minimal sequence 
derived from non-human Ig. 

Generally, a humanized antibody has one or more amino acid residues introduced 
from a non-human source. These non-human amino acid residues are often referred to as 
"import" residues, which are typically taken from an "import" variable domain. 
Humanization is accomplished by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody (Jones et al, 1986; Riechmann et al, 
1988; Verhoeyen et al, 1988). Such "humanized" Abs are chimeric Abs (U.S. Patent No. 
4816567, 1989), wherein substantially less than an intact human variable domain has 
been substituted by the corresponding sequence from a non-human species. In practice, 
humanized Abs are typically human Abs in which some CDR residues and possibly some 
FR residues are substituted by residues from analogous sites in rodent Abs. Humanized 
Abs include human Igs (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit, having the desired 
specificity, affinity and capacity. In some instances, corresponding non-human residues 
replace F v framework residues of the human Ig. Humanized Abs may comprise residues 
that are found neither in the recipient antibody nor in the imported CDR or framework 
sequences. In general, the humanized antibody comprises substantially all of at least one, 
and typically two, variable domains, in which most if not all of the CDR regions 
correspond to those of a non-human Ig and most if not all of the FR regions are those of a 
human Ig consensus sequence. The humanized antibody optimally also comprises at least 
a portion of an Ig constant region (F c ), typically that of a human Ig (Jones et al, 1986; 
Presta, 1992; Riechmann et al, 1988). 

Human Abs can also be produced using various techniques, including phage 
display libraries (Hoogenboom et al, 1991; Marks et al, 1991) and the preparation of 
human mAbs (Boerner et al, 1991; Reisfeld and Sell, 1985). Similarly, introducing 
human Ig genes into transgenic animals in which the endogenous Ig genes have been 
partially or completely inactivated can be exploited to synthesize human Abs. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire 
(U.S. Patent No. 5545807, 1996; U.S. Patent No. 5545806, 1996; U.S. Patent No. 
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5569825, 1996; U.S. Patent No. 5633425, 1997; U.S. Patent No. 5661016, 1997; U.S. 
Patent No. 5625126, 1997; Fishwild et al, 1996; Lonberg and Huszar, 1995; Lonberg et 
al, 1994; Marks et al, 1992). 

5. Bi-specific mAbs 

Bi-specific Abs are monoclonal, preferably human or humanized, that have 
binding specificities for at least two different antigens. For example, a binding specificity 
is an AAP; the other is for any antigen of choice, preferably a cell-surface protein or 
receptor or receptor subunit. 

Traditionally, the recombinant production of bi-specific Abs is based on the co- 
expression of two Ig heavy-chain/light-chain pairs, where the two heavy chains have 
different specificities (Milstein and Cuello, 1983). Because of the random assortment of 
Ig heavy and light chains, the resulting hybridomas (quadromas) produce a potential 
mixture often different antibody molecules, of which only one has the desired bi-specific 
structure. The desired antibody can be purified using affinity chromatography or other 
techniques (WO 93/08829, 1993; Traunecker et al, 1991). 

To manufacture a bi-specific antibody (Suresh et al, 1986), variable domains with 
the desired antibody-antigen combining sites are fused to Ig constant domain sequences. 
The fusion is preferably with an Ig heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. Preferably, the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding is in at least one of the 
fusions. DNAs encoding the Ig heavy-chain fusions and, if desired, the Ig light chain, are 
inserted into separate expression vectors and are co-transfected into a suitable host 
organism. 

The interface between a pair of antibody molecules can be engineered to 
maximize the percentage of heterodimers that are recovered from recombinant cell 
culture (WO 96/27011, 1996). The preferred interface comprises at least part of the CH3 
region of an antibody constant domain. In this method, one or more small amino acid 
side chains from the interface of the first antibody molecule are replaced with larger side 
chains {e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size 
to the large side chain(s) are created on the interface of the second antibody molecule by 
replacing large amino acid side chains with smaller ones {e.g. alanine or threonine). This 
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mechanism increases the yield of the heterodimer over unwanted end products such as 
homodimers. 

Bi-specific Abs can be prepared as full length Abs or antibody fragments (e.g. 
F ( ab')2 bi-specific Abs). One technique to generate bi-specific Abs exploits chemical 
linkage. Intact Abs can be proteolytically cleaved to generate F (ab > )2 fragments (Brennan 
et al, 1985). Fragments are reduced with a dithiol complexing agent, such as sodium 
arsenite, to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The 
generated F ab > fragments are then converted to thionitrobenzoate (TNB) derivatives. One 
of the F a b-TNB derivatives is then reconverted to the F ab -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other F ab -TNB 
derivative to form the bi-specific antibody. The produced bi-specific Abs can be used as 
agents for the selective immobilization of enzymes. 

Fab' fragments may be directly recovered from E. coli and chemically coupled to 
form bi-specific Abs. For example, fully humanized bi-specific F( a v )2 Abs can be 
produced (Shalaby et al, 1992). Each F ab > fragment is separately secreted from E. coli 
and directly coupled chemically in vitro, forming the bi-specific antibody. 

Various techniques for making and isolating bi-specific antibody fragments 
directly from recombinant cell culture have also been described. For example, leucine 
zipper motifs can be exploited (Kostelny et al, 1992). Peptides from the Fos and Jun 
proteins are linked to the F ab - portions of two different Abs by gene fusion. The antibody 
homodimers are reduced at the hinge region to form monomers and then re-oxidized to 
form antibody heterodimers. This method can also produce antibody homodimers. The 
"diabody" technology (Holliger et al, 1993) provides an alternative method to generate 
bi-specific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V H ) connected to a light-chain variable domain (V L ) by a linker that is too short to allow 
pairing between the two domains on the same chain. The V H and V L domains of one 
fragment are forced to pair with the complementary V L and V H domains of another 
fragment, forming two antigen-binding sites. Another strategy for making bi-specific 
antibody fragments is the use of single-chain F v (sF v ) dimers (Gruber et al, 1994). Abs 
with more than two valencies are also contemplated, such as tri-specific Abs (Tutt et al, 
1991). 
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Exemplary bi-specific Abs may bind to two different epitopes on a given AAP. 
Alternatively, cellular defense mechanisms can be restricted to a particular cell expressing 
the particular AAP: an anti-AAP arm may be combined with an arm that binds to a 
leukocyte triggering molecule, such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, 
or B7), or to F c receptors for IgG (F c yR), such as F c yRI (CD64), F c yRII (CD32) and 
F c yRni (CD 16). Bi-specific Abs may also be used to target cytotoxic agents to cells that 
express a particular AAP. These Abs possess an AAP-binding arm and an arm that binds 
a cytotoxic agent or a radionuclide chelator. 

6. Heteroconjugate Abs 

Heteroconjugate Abs, consisting of two covalently joined Abs, have been 
proposed to target immune system cells to unwanted cells (4,676,980, 1987) and for 
treatment of human immunodeficiency virus (HIV) infection (WO 91/00360, 1991; WO 
92/20373, 1992). Abs prepared in vitro using synthetic protein chemistry methods, 
including those involving cross-linking agents, are contemplated. For example, 
immunotoxins may be constructed using a disulfide exchange reaction or by forming a 
thioether bond. Examples of suitable reagents include iminothiolate and methyl-4- 
mercaptobutyrimidate (4,676,980, 1987). 

7. Immunoconjugates 

Immunoconjugates may comprise an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin {e.g., an enzymatically active toxin or fragment 
of bacterial, fungal, plant, or animal origin), or a radioactive isotope (i.e., a 
radioconjugate). 

Useful enzymatically-active toxins and fragments include Diphtheria A chain, 
non-binding active fragments of Diphtheria toxin, exotoxin A chain from Pseudomonas 
aeruginosa, ricin A chain, abrin A chain, modeccin A chain, a-sarcin, Aleurites fordii 
proteins, Dianthin proteins, Phytolaca americana proteins, Momordica charantia 
inhibitor, curcin, crotin, Sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, 
phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available 
for the production of radioconjugated Abs, such as 212 Bi, 131 1, 131 In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bi- 
functional protein-coupling agents, such as N-succinimidyl-3-(2-pyridyldithiol) 
propionate (SPDP), iminothiolane (IT), bi-functional derivatives of imidoesters (such as 
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dimethyl adipimidate HC1), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), fo's-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
ethylenediamine), diisocyanates (such as tolyene 2,6- diisocyanate), and frw-active 
fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared (Vitetta et al, 1987). 14 C-labeled 1-isothiocyanatobenzyl- 
3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating 
agent for conjugating radionuclide to antibody (WO 94/1 1026, 1994). 

In another embodiment, the antibody may be conjugated to a "receptor" (such as 
streptavidin) for utilization in tumor pre-targeting wherein the antibody-receptor 
conjugate is administered to the patient, followed by removal of unbound conjugate from 
the circulation using a clearing agent and then administration of a streptavidin "ligand" 
{e.g., biotin) that is conjugated to a cytotoxic agent {e.g., a radionuclide). 

8. Effector function engineering 

The antibody can be modified to enhance its effectiveness in treating a disease, 
such as cancer. For example, cysteine residue(s) may be introduced into the F c region, 
thereby allowing interchain disulfide bond formation in this region. Such homodimeric 
Abs may have improved internalization capability and/or increased complement-mediated 
cell killing and antibody-dependent cellular cytotoxicity (ADCC) (Caron et al, 1992; 
Shopes, 1992). Homodimeric Abs with enhanced anti-tumor activity can be prepared 
using hetero-bifunctional cross-linkers (Wolff et al, 1993). Alternatively, an antibody 
engineered with dual F c regions may have enhanced complement lysis (Stevenson et al, 
1989). 

9. Immunoliposomes 

Liposomes containing the antibody may also be formulated (U.S. Patent No. 
4485045, 1984; U.S. Patent No. 4544545, 1985; U.S. Patent No. 5013556, 1991; Eppstein 
et al, 1985; Hwang et al, 1980). Useful liposomes can be generated by a reverse-phase 
evaporation method with a lipid composition comprising phosphatidylcholine, 
cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG- PE). Such 
preparations are extruded through filters of defined pore size to yield liposomes with a 
desired diameter. F ab - fragments of the antibody can be conjugated to the liposomes 
(Martin and Papahadjopoulos, 1982) via a disulfi de-interchange reaction. A 
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chemotherapeutic agent, such as Doxorubicin, may also be contained in the liposome 
(Gabizon et ah, 1989). Other useful liposomes with different compositions are 
contemplated. 

10. Diagnostic applications ofAbs directed against an AAP 

Anti-AAP Abs can be used to localize and/or quantitate anAAP {e.g. , for use in 
measuring levels of an AAP within tissue samples or for use in diagnostic methods, etc.). 
Anti-AAP epitope Abs can be utilized as pharmacologically-active compounds. 

Anti-AAP Abs can be used to isolate an AAP by standard techniques, such as 
immunoaffinity chromatography or immunoprecipitation. These approaches facilitate 
purifying an endogenous AAP antigen-containing polypeptides from cells and tissues. 
These approaches, as well as others, can be used to detect an AAP in a sample to evaluate 
the abundance and pattern of expression of the antigenic protein. Anti-AAP Abs can be 
used to monitor protein levels in tissues as part of a clinical testing procedure; for 
example, to determine the efficacy of a given treatment regimen. Coupling the antibody 
to a detectable substance (label) allows detection of Ab-antigen complexes. Classes of 
labels include fluorescent, luminescent, bioluminescent, and radioactive materials, 
enzymes and prosthetic groups. Useful labels include horseradish peroxidase, alkaline 
phosphatase, [3-galactosidase, acetylcholinesterase, streptavidin/biotin, avidin/biotin, 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride, phycoerythrin, luminol, luciferase, luciferin, aequorin, and 
125 1, 131 1, 35 Sor 3 H. 

1 1 . Antibody therapeutics 

Abs of the invention, including polyclonal, monoclonal, humanized and fully 
human Abs, can be used therapeutically. Such agents will generally be employed to treat 
or prevent a disease or pathology in a subject. An antibody preparation, preferably one 
having high antigen specificity and affinity generally mediates an effect by binding the 
target epitope(s). Generally, administration of such Abs may mediate one of two effects: 
(1) the antibody may prevent ligand binding, eliminating endogenous ligand binding and 
subsequent signal transduction, or (2) the antibody elicits a physiological result by 
binding an effector site on the target molecule, initiating signal transduction. 

A therapeutically effective amount of an antibody relates generally to the amount 
needed to achieve a therapeutic objective, epitope binding affinity, administration rate, 
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and depletion rate of the antibody from a subject. Common ranges for therapeutically 
effective doses may be, as a nonlimiting example, from about 0.1 mg/kg body weight to 
about 50 mg/kg body weight. Dosing frequencies may range, for example, from twice 
daily to once a week. 

12. Pharmaceutical compositions ofAbs 

Anti-AAP Abs, as well as other AAP interacting molecules (such as aptamers) 
identified in other assays, can be administered in pharmaceutical compositions to treat 
various disorders. Principles and considerations involved in preparing such compositions, 
as well as guidance in the choice of components can be found in (de Boer, 1994; 
Gennaro, 2000; Lee, 1990). 

Since some AAP are intracellular, Abs that are internalized are preferred used 
when whole Abs are used as inhibitors. Liposomes may also be used as a delivery 
vehicle for intracellular introduction. Where antibody fragments are used, the smallest 
inhibitory fragment that specifically binds to the epitope is preferred. For example, 
peptide molecules can be designed that bind a preferred epitope based on the variable- 
region sequences of a useful antibody. Such peptides can be synthesized chemically 
and/or produced by recombinant DNA technology (Marasco et al, 1993). Formulations 
may also contain more than one active compound for a particular treatment, preferably 
those with activities that do not adversely affect each other. The composition may 
comprise an agent that enhances function, such as a cytotoxic agent, cytokine, 
chemotherapeutic agent, or growth-inhibitory agent. 

The active ingredients can also be entrapped in microcapsules prepared by 
coacervation techniques or by interfacial polymerization; for example, 
hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

The formulations to be used for in vivo administration are highly preferred to be 
sterile. This is readily accomplished by filtration through sterile filtration membranes or 
any of a number of techniques. 

Sustained-release preparations may also be prepared, such as semi-permeable 
matrices of solid hydrophobic polymers containing the antibody, which matrices are in 
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the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release 
matrices include polyesters, hydrogels (for example, poly (2-hydroxyethyl-methacryl ate), 
or poly(vinylalcohol)), polylactides (Boswell and Scribner, U.S. Patent No. 3,773,919, 
1973), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as injectable 
microspheres composed of lactic acid-glycolic acid copolymer, and poly-D-(-)-3- 
hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid- 
glycolic acid enable release of molecules for over 100 days, certain hydrogels release 
proteins for shorter time periods and may be preferred. 

AAP recombinant expression vectors and host cells 

Vectors are tools used to shuttle DNA between host cells or as a means to express 
a nucleotide sequence. Some vectors function only in prokaryotes, while others function 
in both prokaryotes and eukaryotes, enabling large-scale DNA preparation from 
prokaryotes for expression in eukaryotes. Inserting the DNA of interest, such as an AAP 
nucleotide sequence or a fragment, is accomplished by ligation techniques and/or mating 
protocols well-known to the skilled artisan. Such DNA is inserted such that its 
integration does not disrupt any necessary components of the vector. In the case of 
vectors that are used to express the inserted DNA protein, the introduced DNA is 
operably-linked to the vector elements that govern its transcription and translation. 

Vectors can be divided into two general classes: Cloning vectors are replicating 
plasmid or phage with regions that are non-essential for propagation in an appropriate 
host cell, and into which foreign DNA can be inserted; the foreign DNA is replicated and 
propagated as if it were a component of the vector. An expression vector (such as a 
plasmid, yeast, or animal virus genome) is used to introduce foreign genetic material into 
a host cell or tissue in order to transcribe and translate the foreign DNA. In expression 
vectors, the introduced DNA is operably-linked to elements, such as promoters, that 
signal to the host cell to transcribe the inserted DNA. Some promoters are exceptionally 
useful, such as inducible promoters that control gene transcription in response to specific 
factors. Operably-linking an AAP or anti-sense construct to an inducible promoter can 
control the expression of an AAP or fragments, or anti-sense constructs. Examples of 
classic inducible promoters include those that are responsive to a-interferon, heat-shock, 
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heavy metal ions, and steroids such as glucocorticoids (Kaufman, 1990) and tetracycline. 
Other desirable inducible promoters include those that are not endogenous to the cells in 
which the construct is being introduced, but, however, is responsive in those cells when 
the induction agent is exogenously supplied. 

Vectors have many difference manifestations. A "plasmid" is a circular double 
stranded DNA molecule into which additional DNA segments can be introduced. Viral 
vectors can accept additional DNA segments into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell (e.g., bacterial vectors having a bacterial 
origin of replication and episomal mammalian vectors). Other vectors (e.g., 
non-episomal mammalian vectors) are integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are replicated along with the host genome. In 
general, useful expression vectors are often plasmids. However, other forms of 
expression vectors, such as viral vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) are contemplated. 

Recombinant expression vectors that comprise an AAP (or fragments) regulate an 
AAP transcription by exploiting one or more host cell-responsive (or that can be 
manipulated in vitro) regulatory sequences that is operably-linked to an AAP. "Operably- 
linked" indicates that a nucleotide sequence of interest is linked to regulatory sequences 
such that expression of the nucleotide sequence is achieved. 

Vectors can be introduced in a variety of organisms and/or cells (Table D). 
Alternatively, the vectors can be transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase. 



Table D Examples of hosts for cloning or expression 



Organisms 


Examples 


Sources and References* 


Prokaryotes 






Enterobacteriaceae 


E. coli 






K 12 strain MM294 


ATCC 31,446 




X1776 


ATCC 31,537 




W3110 


ATCC 27,325 




K5 772 


ATCC 53,635 




Enterobacter 






Erwinia 






Klebsiella 






Proteus 
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Table D Examples of hosts for cloning or expression 



Organisms 


Examples 


Sources and References* 




Salmonella (S. tyhpimurium) 




Serratia (S. marcescans) 




Shigella 




Bacilli (B. subtilis and B. 
licheniformis) 




Pseudomonas (P. aeruginosa) 




Streptomyces 




Eukaryotes 


Saccharomyces cerevisiae 




Yeasts 


Schizosaccharomyces pombe 




Kluyveromyces 
K. lactis MW98-8C, 
CBS683, CBS4574 
K. fragilis 
K. bulgaricus 
K. wickeramii 
K. waltii 

K. drosophilarum 
K. thermotolerans 
K. marxianus; yarrowia 


(Fleer etal, 1991) 

(de Louvencourt et al., 1983) 

ATCC 12,424 
ATCC 16,045 
ATCC 24,178 
ATCC 56,500 
ATCC 36,906 

(EPO 402226, 1990) 


Pichia pastoris 


(Sreekrishnaez'tf/., 1988) 


Candida 




Trichoderma reesia 




Neurospora crassa 


(Case et al, 1979) 


Torulopsis 




Rhodotorula 




Schwanniomyces (S. 
occidentalis) 




Filamentous Fungi 


Neurospora 




Penicillium 




Tolypocladium 


(WO 91/00357, 1991) 


Aspergillus (A. nidulans and 
A. niger) 


(Kelly andHynes, 1985; Tilburn 
et al, 1983; Yelton et al, 1984) 


Invertebrate cells 


Drosophila S2 




Spodoptera Sf9 




Vertebrate cells 


Chinese Hamster Ovary 
(CHO) 




simian COS 
COS-7 


ATCC CRL 1651 


HEK293 




^Unreferenced cells are generally available from American Type Culture Collection 
(Manassas, VA). 
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Vector choice is dictated by the organism or cells being used and the desired fate 
of the vector. Vectors may replicate once in the target cells, or may be "suicide" vectors. 
In general, vectors comprise signal sequences, origins of replication, marker genes, 
enhancer elements, promoters, and transcription termination sequences. The choice of 
these elements depends on the organisms in which the vector will be used and are easily 
determined. Some of these elements may be conditional, such as an inducible or 
conditional promoter that is turned "on" when conditions are appropriate. Examples of 
inducible promoters include those that are tissue-specific, which relegate expression to 
certain cell types, steroid-responsive, or heat-shock reactive. Some bacterial repression 
systems, such as the lac operon, have been exploited in mammalian cells and transgenic 
animals (Fieck et al, 1992; Wyborski et al, 1996; Wyborski and Short, 1991). Vectors 
often use a selectable marker to facilitate identifying those cells that have incorporated 
the vector. Many selectable markers are well known in the art for the use with 
prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and auxotrophy 
mutants. 

Using antisense and sense AAP oligonucleotides can prevent an AAP polypeptide 
expression. These oligonucleotides bind to target nucleic acid sequences, forming 
duplexes that block transcription or translation of the target sequence by enhancing 
degradation of the duplexes, terminating prematurely transcription or translation, or by 
other means. 

Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA 
or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) 
sequences. According to the present invention, antisense or sense oligonucleotides 
comprise a fragment of an AAP DNA coding region of at least about 14 nucleotides, 
preferably from about 14 to 30 nucleotides. In general, antisense RNA or DNA 
molecules can comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60. 65, 70, 75, 
80, 85, 90, 95, 100 bases in length or more. Among others, (Stein and Cohen, 1988; van 
der Krol et al, 1988a) describe methods to derive antisense or a sense oligonucleotides 
from a given cDNA sequence. 

Modifications of antisense and sense oligonucleotides can augment their 
effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 
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91/06629, 1991), increase in vivo stability by conferring resistance to endogenous 
nucleases without disrupting binding specificity to target sequences. Other modifications 
can increase the affinities of the oligonucleotides for their targets, such as covalently 
linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments 
modify binding specificities of the oligonucleotides for their targets, including metal 
complexes or intercalating (e.g. ellipticine) and alkylating agents. 

To introduce antisense or sense oligonucleotides into target cells (cells containing 
the target nucleic acid sequence), any gene transfer method may be used and are well 
known to those of skill in the art. Examples of gene transfer methods include 1) 
biological, such as gene transfer vectors like Epstein-Barr virus or conjugating the 
exogenous DNA to a ligand-binding molecule (WO 91/04753, 1991), 2) physical, such as 
electroporation, and 3) chemical, such as CaP0 4 precipitation and oligonucleotide-lipid 
complexes (WO 90/10448, 1990). 

The terms "host cell" and "recombinant host cell" are used interchangeably. Such 
terms refer not only to a particular subject cell but also to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term. 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are 
well known in the art. The choice of host cell will dictate the preferred technique for 
introducing the nucleic acid of interest. Table E, which is not meant to be limiting, 
summarizes many of the known techniques in the art. Introduction of nucleic acids into 
an organism may also be done with ex vivo techniques that use an in vitro method of 
transfection, as well as established genetic techniques, if any, for that particular organism. 



Table E Methods to introduce nucleic acid into cells 



Cells 


Methods 


References 


Notes 


Prokaryotes 
(bacteria) 


Calcium chloride 


(Cohen et al, 1972; 
Hanahan, 1983; Mandel and 
Higa, 1970) 




Electroporation 


(Shigekawa and Dower, 
1988) 




Eukaryotes 
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Table E Methods to introduce nucleic acid into cells 



Cells 


Methods 


References 


Notes 


Mammalian 
cells 


Calcium phosphate 
transfection 


N-{2~ 

Hydroxyethyl)piperazine-iV' - 
(2-ethanesulfonic acid 
(HEPES) buffered saline 
solution (Chen and 
Okayama, 1988; Graham and 
van der Eb, 1973; Wigler et 
al, 1978) 

BES (N,N-bis(2- 
hydroxyethyl)-2- 
aminoethanesulfonic acid) 
buffered solution (Ishiura et 
al, 1982) 


Cells may be 
"shocked" with 
glycerol or 
dimethylsulf oxi de 
(DMSO) to increase 
transfection 
efficiency (Ausubel 
etal, 1987). 


Diethylaminoethyl 

\ UCjt\C J -L/C A II al 1 

transfection 


(Fujita et al, 1986; Lopata et 
ai., lyo^, oeicien et uL, 
1986) 


Most useful for 
transient, but not 
stable, transfections. 
Chloroquine can be 
used to increase 
efficiency. 


Electroporation 


(Neumann et al, 1982; 
Potter, 1988; Potter et al, 
1984; Wong and Neumann, 
1982) 


Especially useful for 

hard-to-transfect 

lymphocytes. 


Cationic lipid 

reagent 

transfection 


(Elroy-Stein and Moss, 1990; 
Feigner et al, 1987; Rose et 
al, 1991; Whitt et al, 1990) 


Applicable to both 
in vivo and in vitro 
transfection. 


Retroviral 


Production exemplified by 
(Cepko et al, 1984; Miller 
and Buttimore, 1986; Pear et 
al, 1993) 

Infection in vitro and in vivo: 
(Austin and Cepko, 1990; 
Bodine et al, 1991; Fekete 
and Cepko, 1993; Lemischka 
et al, 1986; Turner et al, 
1990; Williams et al, 1984) 


Lengthy process, 
many packaging 
lines available at 
ATCC. Applicable 
to both in vivo and 
in vitro transfection. 


Polybrene 


(Chaney et al, 1986; Kawai 
and Nishizawa, 1984) 




Microinjection 


(Capecchi, 1980) 


Can be used to 
establish cell lines 
carrying integrated 
copies of AAP DNA 
sequences. 
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Table E Methods to introduce nucleic acid into cells 



Cells 


Methods 


References 


Notes 




Protoplast fusion 


(Rassoulzadegan et al, 1982; 
Sandri-Goldin et al, 1981; 
Schaffner, 1980) 




Insect cells 
(in vitro) 


Baculovirus 
systems 


(Luckow, 1991; Miller, 
1988; O'Reilly etal, 1992) 


Useful for in vitro 
production of 
proteins with 
eukaryotic 
modifications. 


Yeast 


Electroporation 


(Becker and Guarente, 1991) 




Lithium itcctslc 


(Gietz et al, 1998; Ito et al, 
1983) 




Spheroplast fusion 


(Beggs, 1978; Hinnen et al, 
1978) 


Laborious, can 
produce aneuploids. 


Plant cells 
(general 
reference: 
(Hansen and 
Wright, 
1999)) 


Agrobacterium 
transformation 


(Bechtold and Pelletier, 
1998; Escudero and Hohn, 
1997; Hansen and Chilton, 
1999; Touraev and al., 1997) 




Biolistics 
(microprojectiles) 


(Finer et al, 1999; Hansen 
and Chilton, 1999; Shillito, 
1999) 




Electroporation 
(protoplasts) 


(Fromm et al, 1985; Ou-Lee 
etal, 1986; Rhodes etal, 
1988; Saunders et al, 1989) 
May be combined with 
liposomes (Trick and al., 
1997) 




Polyethylene 
glycol (PEG) 
treatment 


(Shillito, 1999) 




Liposomes 


May be combined with 
electroporation (Trick and 
al., 1997) 




in planta 
microinjection 


(Leduc and al., 1996; Zhou 
and al., 1983) 




Seed imbibition 


(Trick and al., 1997) 




Laser beam 


(Hoffman, 1996) 




Silicon carbide 
whiskers 


(Thompson and al., 1995) 





Vectors often use a selectable marker to facilitate identifying those cells that have 
incorporated the vector. Many selectable markers are well known in the art for the use 
with prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and 
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auxotrophy mutants. Table F lists often-used selectable markers for mammalian cell 
transfection. 



Table F Useful selectable markers for eukaryote cell transfection 



Selectable Marker 


Selection 


Action 


Reference 


Adenosine deaminase 
(ADA) 


Media includes 9-0-D- 
xylofuranosyl adenine 
(Xyl-A) 


Conversion of Xyl-A to 
Xyl-ATP, which 
incorporates into 
nucleic acids, killing 
cells. ADA detoxifies 


(Kaufman et 
al, 1986) 


Dihydrofolate 
reductase (DHFR) 


Methotrexate (MTX) 
and dialyzed serum 
(purine-free media) 


MTX competitive 
inhibitor of DHFR. In 
absence of exogenous 
purines, cells require 
DHFR, a necessary 
enzyme in purine 
biosynthesis. 


(Simonsen 
and 

Le Vinson, 
1983) 


Aminoglycoside 
phosphotransferase 
("APH", "neo", 
"G418") 


G418 


G418, an 
aminoglycoside 
detoxified by APH, 
interferes with 
ribosomal function and 
consequently, 
translation. 


(Southern 
and Berg, 
1982) 


Hygromycin-B - 

phosphotransferase 

(HPH) 


hygromycin-B 


Hygromycin-B, an 
aminocyclitol 
detoxified by HPH, 
disrupts protein 
translocation and 
promotes 
mistranslation. 


(Palmer et 
al, 1987) 


Thymidine kinase 
(TK) 


Forward selection 
(TK+): Media (HAT) 
incorporates 
aminopterin. 
Reverse selection (TK- 
): Media incorporates 
5 -bromodeoxyuri dine 
(BrdU). 


Forward: Aminopterin 
forces cells to synthesze 
dTTP from thymidine, a 
pathway requiring TK. 
Reverse: TK 
phosphorylates BrdU, 
which incorporates into 
nucleic acids, killing 
cells. 


(Littlefield, 
1964) 
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A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce an AAP. Accordingly, the invention provides methods 
for producing an AAP using the host cells of the invention. In one embodiment, the 
method comprises culturing the host cell of the invention (into which a recombinant 
5 expression vector encoding an AAP has been introduced) in a suitable medium, such that 

an AAP is produced. In another embodiment, the method further comprises isolating an 
AAP from the medium or the host cell. 

Transgenic AAP animals 

Transgenic animals are useful for studying the function and/or activity of an AAP 

0 and for identifying and/or evaluating modulators of AAP activity. "Transgenic animals" 

are non-human animals, preferably mammals, more preferably a rodents such as rats or 
mice, in which one or more of the cells include a transgene. Other transgenic animals 
include primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A "transgene" is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic 

5 animal develops, and that remains in the genome of the mature animal. Transgenes 

preferably direct the expression of an encoded gene product in one or more cell types or 
tissues of the transgenic animal with the purpose of preventing expression of a naturally 
encoded gene product in one or more cell types or tissues (a "knockout" transgenic 
animal), or serving as a marker or indicator of an integration, chromosomal location, or 

0 region of recombination (e.g. cre/loxP mice). A "homologous recombinant animal" is a 

non-human animal, such as a rodent, in which an endogenous AAP has been altered by an 
exogenous DNA molecule that recombines homologously with an endogenous AAP in a 
(e.g. embryonic) cell prior to development the animal. Host cells with an exogenous AAP 
can be used to produce non-human transgenic animals, such as fertilized oocytes or 

5 embryonic stem cells into which an AAF-coding sequences have been introduced. Such 

host cells can then be used to create non-human transgenic animals or homologous 
recombinant animals. 

1 . Approaches to transgenic animal production 

A transgenic animal can be created by introducing an AAP into the male pronuclei 
) of a fertilized oocyte (e.g., by microinjection, retroviral infection) and allowing the oocyte 

to develop in a pseudopregnant female foster animal (pffa). An AAP cDNA sequences 
(SEQ ID NO:l, 3, 5, 7, 9, 11, 13 or 15) can be introduced as a transgene into the genome 
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of a non-human animal. Alternatively, a homologue of an AAP, such as the naturally- 
occuring variant of an AAP, can be used as a transgene. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase transgene 
expression. Tissue-specific regulatory sequences can be operably-linked to the AAP 
transgene to direct expression of the AAP to particular cells. Methods for generating 
transgenic animals via embryo manipulation and microinjection, particularly animals such 
as mice, have become conventional in the art, e.g. (Evans et al., U.S. Patent No. 
4,870,009, 1989; Hogan, 0879693843, 1994; Leder and Stewart, U.S. Patent No. 
4,736,866, 1988; Wagner and Hoppe, US Patent No. 4,873,191, 1989). Other non-mice 
transgenic animals may be made by similar methods. A transgenic founder animal, which 
can be used to breed additional transgenic animals, can be identified based upon the 
presence of the transgene in its genome and/or expression of the transgene mRNA in 
tissues or cells of the animals. Transgenic animals can be bred to other transgenic 
animals carrying other transgenes. 

2. Vectors for transgenic animal production 

To create a homologous recombinant animal, a vector containing at least a portion 
of an AAP into which a deletion, addition or substitution has been introduced to thereby 
alter, e.g., disrupt or alter the expression of, an AAP. An AAP can be a murine gene, or 
other AAP homologue, such as a naturally occurring variant. In one approach, a knockout 
vector functionally disrupts an endogenous AAP gene upon homologous recombination, 
and thus a non-functional AAP protein, if any, is expressed. 

Alternatively, the vector can be designed such that, upon homologous 
recombination, an endogenous AAP is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region can be altered to thereby alter the 
expression of an endogenous AAP). In this type of homologous recombination vector, 
the altered portion of the AAP is flanked at its 5'- and 3'-termini by additional nucleic acid 
of the AAP to allow for homologous recombination to occur between the exogenous AAP 
carried by the vector and an endogenous AAP in an embryonic stem cell. The additional 
flanking AAP nucleic acid is sufficient to engender homologous recombination with the 
endogenous AAP. Typically, several kilobases of flanking DNA (both at the 5'- and 3'- 
termini) are included in the vector (Thomas and Capecchi, 1987). The vector is then 
introduced into an embryonic stem cell line (e.g., by electroporation), and cells in which 
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the introduced AAP has homologously-recombined with an endogenous AAP are selected 
(Li etal., 1992). 

3. Introduction of an AAP trans gene cells during development 

Selected cells are then injected into a blastocyst of an animal {e.g., a mouse) to 
5 form aggregation chimeras (Bradley, 1987). A chimeric embryo can then be implanted 

into a suitable pffa and the embryo brought to term. Progeny harboring the 
homologously-recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously-recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
1 0 vectors and homologous recombinant animals are described (Bems et ah, WO 93/04169, 

1993; Bradley, 1991; Kucherlapati et al, WO 91/01140, 1991; Le Mouellic and Brullet, 
WO 90/11354, 1990). 

Alternatively, transgenic animals that contain selected systems that allow for 
regulated expression of the transgene can be produced. An example of such a system is 
5 the cre/loxP recombinase system of bacteriophage PI (Lakso et al., 1992). Another 

recombinase system is the FLP recombinase system of Saccharomyces cerevisiae 
(O'Gorman et al, 1991). If a creAoxP recombinase system is used to regulate expression 
of the transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be produced as "double" transgenic 
0 animals, by mating an animal containing a transgene encoding a selected protein to 

another containing a transgene encoding a recombinase. 

Clones of transgenic animals can also be produced (Wilmut et al, 1997). In brief, 
a cell from a transgenic animal can be isolated and induced to exit the growth cycle and 
enter G 0 phase. The quiescent cell can then be fused to an enucleated oocyte from an 
5 animal of the same species from which the quiescent cell is isolated. The reconstructed 

oocyte is then cultured to develop to a morula or blastocyte and then transferred to a pffa 
The offspring borne of this female foster animal will be a clone of the "parent" transgenic 
animal. 

Pharmaceutical compositions 
) The AAP nucleic acid molecules, AAP polypeptides, and anti-AAP Abs (active 

compounds) of the invention, and derivatives, fragments, analogs and homologs thereof, 
can be incorporated into pharmaceutical compositions. Such compositions typically 



- 103- 



comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. A "pharmaceutically acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, and the like, compatible with pharmaceutical administration (Gennaro, 
2000). Preferred examples of such carriers or diluents include, but are not limited to, 
water, saline, finger's solutions, dextrose solution, and 5% human serum albumin. 
Liposomes and non-aqueous vehicles such as fixed oils may also be used. Except when a 
conventional media or agent is incompatible with an active compound, use of these 
compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

1 . General considerations 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration, including intravenous, intradermal, 
subcutaneous, oral {e.g., inhalation), transdermal (i.e., topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include: a sterile diluent such as water for injection, saline 
solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants 
such as ascorbic acid or sodium bisulfite; chelating agents such as 

ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH 
can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

2. Injectable formulations 

Pharmaceutical compositions suitable for injection include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous 
administration, suitable carriers include physiological saline, bacteriostatic water, 
CREMOPHOR EL™ (BASF, Parsippany, NJ.) or phosphate buffered saline (PBS). In all 
cases, the composition must be sterile and should be fluid so as to be administered using a 
syringe. Such compositions should be stable during manufacture and storage and must be 



-104- 



preserved against contamination from microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (such as glycerol, propylene glycol, and liquid polyethylene glycol), and suitable 
mixtures. Proper fluidity can be maintained, for example, by using a coating such as 
lecithin, by maintaining the required particle size in the case of dispersion and by using 
surfactants. Various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, and thimerosal, can contain microorganism 
contamination. Isotonic agents, for example, sugars, polyalcohols such as manitol, 
sorbitol, and sodium chloride can be included in the composition. Compositions that can 
delay absorption include agents such as aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., an AAP or anti-AAP antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients as required, followed by sterilization. 
Generally, dispersions are prepared by incorporating the active compound into a sterile 
vehicle that contains a basic dispersion medium, and the other required ingredients as 
discussed. Sterile powders for the preparation of sterile injectable solutions, methods of 
preparation include vacuum drying and freeze-drying that yield a powder containing the 
active ingredient and any desired ingredient from a sterile solutions. 

3. Oral compositions 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally. Pharmaceutic ally compatible binding agents, and/or adjuvant materials 
can be included. Tablets, pills, capsules, troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as microcrystalline 
cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, PREVIOGEL, or corn starch; a lubricant such as 
magnesium stearate or STEROTES; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 



-105- 



4. Compositions for inhalation 

For administration by inhalation, the compounds are delivered as an aerosol spray 
from a nebulizer or a pressurized container that contains a suitable propellant, e.g., a gas 
such as carbon dioxide. 

5. Systemic administration 

Systemic administration can also be transmucosal or transdermal. For 
transmucosal or transdermal administration, penetrants that can permeate the target 
barrier(s) are selected. Transmucosal penetrants include, detergents, bile salts, and 
fusidic acid derivatives. Nasal sprays or suppositories can be used for transmucosal 
administration. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams. 

The compounds can also be prepared in the form of suppositories {e.g., with bases 
such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

6. Carriers 

In one embodiment, the active compounds are prepared with carriers that protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Such materials can be 
obtained commercially from ALZA Corporation (Mountain View, CA) and NOVA 
Pharmaceuticals, Inc. (Lake Elsinore, CA), or prepared by one of skill in the art. 
Liposomal suspensions can also be used as pharmaceutically acceptable carriers. These 
can be prepared according to methods known to those skilled in the art, such as in 
(Eppstein etal, US Patent No. 4,522,811, 1985). 

7. Unit dosage 

Oral formulations or parenteral compositions in unit dosage form can be created to 
facilitate administration and dosage uniformity. Unit dosage form refers to physically 
discrete units suited as single dosages for the subject to be treated, containing a 
therapeutically effective quantity of active compound in association with the required 
pharmaceutical carrier. The specification for the unit dosage forms of the invention are 
dictated by, and directly dependent on, the unique characteristics of the active compound 
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and the particular desired therapeutic effect, and the inherent limitations of compounding 
the active compound. 

8. Gene therapy compositions 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (Nabel and Nabel, US Patent No. 
5,328,470, 1994), or by stereotactic injection (Chen et ah, 1994). The pharmaceutical 
preparation of a gene therapy vector can include an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where 
the complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 
produce the gene delivery system. 

9. Dosage 

The pharmaceutical composition and method of the present invention may further 
comprise other therapeutically active compounds as noted herein that are usually applied 
in the treatment of the above mentioned pathological conditions. 

In the treatment or prevention of conditions which require AAP modulation an 
appropriate dosage level will generally be about 0.01 to 500 mg per kg patient body 
weight per day which can be administered in single or multiple doses. Preferably, the 
dosage level will be about 0.1 to about 250 mg/kg per day; more preferably about 0.5 to 
about 100 mg/kg per day. A suitable dosage level may be about 0.01 to 250 mg/kg per 
day, about 0.05 to 100 mg/kg per day, or about 0.1 to 50 mg/kg per day. Within this 
range the dosage may be 0.05 to 0.5, 0.5 to 5 or 5 to 50 mg/kg per day. For oral 
administration, the compositions are preferably provided in the form of tablets containing 
1.0 to 1000 milligrams of the active ingredient, particularly 1.0, 5.0, 10.0, 15.0. 20.0, 
25.0, 50.0, 75.0, 100.0, 150.0, 200.0, 250.0, 300.0, 400.0, 500.0, 600.0, 750.0, 800.0, 
900.0, and 1000.0 milligrams of the active ingredient for the symptomatic adjustment of 
the dosage to the patient to be treated. The compounds may be administered on a 
regimen of 1 to 4 times per day, preferably once or twice per day. 

It will be understood, however, that the specific dose level and frequency of 
dosage for any particular patient may be varied and will depend upon a variety of factors 
including the activity of the specific compound employed, the metabolic stability and 
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length of action of that compound, the age, body weight, general health, sex, diet, mode 
and time of administration, rate of excretion, drug combination, the severity of the 
particular condition, and the host undergoing therapy. 
10. Kits for pharmaceutical compositions 

The pharmaceutical compositions can be included in a kit, container, pack, or 
dispenser together with instructions for administration. When the invention is supplied as 
a kit, the different components of the composition may be packaged in separate containers 
and admixed immediately before use. Such packaging of the components separately may 
permit long-term storage without losing the active components' functions. 

Kits may also include reagents in separate containers that facilitate the execution 
of a specific test, such as diagnostic tests or tissue typing. For example, AAP DNA 
templates and suitable primers may be supplied for internal controls. 

(a) Containers or vessels 

The reagents included in the kits can be supplied in containers of any sort such 
that the life of the different components are preserved, and are not adsorbed or altered by 
the materials of the container. For example, sealed glass ampules may contain 
lyophilized luciferase or buffer that have been packaged under a neutral, non-reacting gas, 
such as nitrogen. Ampoules may consist of any suitable material, such as glass, organic 
polymers, such as polycarbonate, polystyrene, etc., ceramic, metal or any other material 
typically employed to hold reagents. Other examples of suitable containers include 
simple bottles that may be fabricated from similar substances as ampules, and envelopes, 
that may consist of foil-lined interiors, such as aluminum or an alloy. Other containers 
include test tubes, vials, flasks, bottles, syringes, or the like. Containers may have a 
sterile access port, such as a bottle having a stopper that can be pierced by a hypodermic 
injection needle. Other containers may have two compartments that are separated by a 
readily removable membrane that upon removal permits the components to mix. 
Removable membranes may be glass, plastic, rubber, etc. 

(b) Instructional materials 

Kits may also be supplied with instructional materials. Instructions may be 
printed on paper or other substrate, and/or may be supplied as an electronic-readable 
medium, such as a floppy disc, CD-ROM, DVD-ROM, Zip disc, videotape, audiotape, 
etc. Detailed instructions may not be physically associated with the kit; instead, a user 
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may be directed to an internet web site specified by the manufacturer or distributor of the 
kit, or supplied as electronic mail. 

Screening and detection methods 

The isolated nucleic acid molecules of the invention can be used to express an 
AAP {e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect an AAP mRNA (e.g., in a biological sample) or a genetic lesion in 
an AAP, and to modulate AAP activity, as described below. In addition, AAP 
polypeptides can be used to screen drugs or compounds that modulate the AAP activity or 
expression as well as to treat disorders characterized by insufficient or excessive 
production of an AAP or production of AAP forms that have decreased or aberrant 
activity compared to an AAP wild-type protein, or modulate biological function that 
involve an AAP. In addition, the anti-AAP Abs of the invention can be used to detect and 
isolate an AAP and modulate AAP activity. 

1 . Screening assays 

The invention provides a method (screening assay) for identifying modalities, i.e., 
candidate or test compounds or agents (e.g., peptides, peptidomimetics, small molecules 
or other drugs), foods, combinations thereof, etc. , that effect an AAP, a stimulatory or 
inhibitory effect, inlcuding translation, transcription, activity or copies of the gene in 
cells. The invention also includes compounds identified in screening assays. 

Testing for compounds that increase or decrease AAP activity are desirable. A 
compound may modulate an AAP activity by affecting: (1) the number of copies of the 
gene in the cell (amplifiers and deamplifiers); (2) increasing or decreasing transcription of 
an AAP (transcription up-regulators and down-regulators); (3) by increasing or decreasing 
the translation of an AAP mRNA into protein (translation up-regulators and down- 
regulators); or (4) by increasing or decreasing the activity of an AAP itself (agonists and 
antagonists). 

(a) effects of compounds 

To identify compounds that affect an AAP at the DNA, RNA and protein levels, 
cells or organisms are contacted with a candidate compound and the corresponding 
change in an AAP DNA, RNA or protein is assessed (Ausubel et al, 1987). For DNA 
amplifiers and deamplifiers, the amount of an AAP DNA is measured, for those 
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compounds that are transcription up-regulators and down-regulators the amount of an 
AAP mRNA is determined; for translational up- and down-regulators, the amount of an 
AAP polypeptide is measured. Compounds that are agonists or antagonists may be 
identified by contacting cells or organisms with the compound, and then examining, for 
example, the model of angiogenesis in vitro. 

In one embodiment, many assays for screening candidate or test compounds that 
bind to or modulate the activity of an AAP or polypeptide or biologically-active portion 
are available. Ttest compounds can be obtained using any of the numerous approaches in 
combinatorial library methods, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptides, while the other four approaches encompass peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, 1997). 

(b) small molecules 

A "small molecule" refers to a composition that has a molecular weight of less 
than about 5 kD and more preferably less than about 4 kD, most preferably less than 0.6 
kD. Small molecules can be, nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art 
and can be screened with any of the assays of the invention. Examples of methods for the 
synthesis of molecular libraries can be found in: (Carell et al, 1994a; Carell et al, 1994b; 
Cho et al., 1993; DeWitt et al, 1993; Gallop et al, 1994; Zuckermann et al, 1994). 

Libraries of compounds maybe presented in solution (Houghten et al, 1992) or 
on beads (Lam et al, 1991), on chips (Fodor et al, 1993), bacteria, spores (Ladner et al, 
US Patent No. 5,223,409, 1993), plasmids (Cull et al, 1992) or on phage (Cwirla et al, 
1990; Devlin et al, 1990; Felici et al, 1991; Ladner et al, US Patent No. 5,223,409, 
1993; Scott and Smith, 1990). A cell-free assay comprises contacting an AAP or 
biologically-active fragment with a known compound that binds the AAP to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with the AAP, where determining the ability of the 
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test compound to interact with the AAP comprises determining the ability of the AAP to 
preferentially bind to or modulate the activity of an AAP target molecule. 

(c) cell-free assays 

The cell-free assays of the invention may be used with both soluble or a 
membrane-bound forms of an AAP. In the case of cell-free assays comprising the 
membrane-bound form, a solubilizing agent to maintain the AAP in solution. Examples 
of such solubilizing agents include non-ionic detergents such as n-octylglucoside, n- 
dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 
methylglucamide, TRITON® X-100 and others from the TRITON® series, THESIT®, 
Isotridecypoly(ethylene glycol ether) n , N-dodecyl-N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 3- 
(3-cholamidopropyl)dimethylamminiol-2-hydroxy- 1 -propane sulfonate (CHAPSO). 

(d) immobilization of target molecules to facilitate screening 

In more than one embodiment of the assay methods, immobilizing either an AAP 
or its partner molecules can facilitate separation of complexed from uncomplexed forms 
of one or both of the proteins, as well as to accommodate high throughput assays. 
Binding of a test compound to an AAP, or interaction of an AAP with a target molecule 
in the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants, such as microtiter plates, test tubes, and 
micro-centrifuge tubes. A fusion protein can be provided that adds a domain that allows 
one or both of the proteins to be bound to a matrix. For example, GST- AAP fusion 
proteins or GST-target fusion proteins can be adsorbed onto glutathione sepharose beads 
(SIGMA Chemical, St. Louis, MO) or glutathione derivatized microtiter plates that are 
then combined with the test compound or the test compound and either the non-adsorbed 
target protein or an AAP, and the mixture is incubated under conditions conducive to 
complex formation {e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microtiter plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined either 
directly or indirectly, for example, as described. Alternatively, the complexes can be 
dissociated from the matrix, and the level of AAP binding or activity determined using 
standard techniques. 



-111 - 



Other techniques for immobilizing proteins on matrices can also be used in 
screening assays. Either an AAP or its target molecule can be immobilized using biotin- 
avidin or biotin-streptavidin systems. Biotinylation can be accomplished using many 
reagents, such as biotin-NHS (N-hydroxy-succinimide; PIERCE Chemicals, Rockford, 
IL), and immobilized in wells of streptavidin-coated 96 well plates (PIERCE Chemical). 
Alternatively, Abs reactive with an AAP or target molecules, but which do not interfere 
with binding of the AAP to its target molecule, can be derivatized to the wells of the 
plate, and unbound target or an AAP trapped in the wells by antibody conjugation. 
Methods for detecting such complexes, in addition to those described for the 
GST-immobilized complexes, include immunodetection of complexes using Abs reactive 
with an AAP or its target, as well as enzyme-linked assays that rely on detecting an 
enzymatic activity associated with the AAP or target molecule. 

(e) screens to identify modulators 

Modulators of AAP expression can be identified in a method where a cell is 
contacted with a candidate compound and the expression of an AAP mRNA or protein in 
the cell is determined. The expression level of the AAP mRNA or protein in the presence 
of the candidate compound is compared to the AAP mRNA or protein levels in the 
absence of the candidate compound. The candidate compound can then be identified as a 
modulator of the AAP mRNA or protein expression based upon this comparison. For 
example, when expression of an AAP mRNA or protein is greater (i.e., statistically 
significant) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of the AAP mRNA or protein expression. 
Alternatively, when expression of the AAP mRNA or protein is less (statistically 
significant) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of the AAP mRNA or protein expression. The 
level of an AAP mRNA or protein expression in the cells can be determined by methods 
described for detecting an AAP mRNA or protein. 

(i) hybrid assays 

In yet another aspect of the invention, an AAP can be used as "bait" in two-hybrid 
or three hybrid assays (Bartel et al, 1993; Brent et al, WO94/10300, 1994; Iwabuchi et 
al, 1993; Madura et al, 1993; Saifer et al, US Patent No. 5,283,317, 1994; Zervos et al, 
1993) to identify other proteins that bind or interact with the AAP and modulate AAP 
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activity. Such AAP-bps are also likely to be involved in the propagation of signals by the 
AAP as, for example, upstream or downstream elements of an AAP pathway. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
AAP is fused to a gene encoding the DNA binding domain of a known transcription 
factor {e.g., GAL4). The other construct, a DNA sequence from a library of DNA 
sequences that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact in vivo, forming an AAP-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene {e.g. , LacZ) that is 
operably-linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected, and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene that 
encodes the AAP-interacting protein. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein. 
2. Detection assays 

Portions or fragments of an AAP cDNA sequences identified herein (and the 
complete AAP gene sequences) are useful in themselves. By way of non-limiting 
example, these sequences can be used to: (1) identify an individual from a minute 
biological sample (tissue typing); and (2) aid in forensic identification of a biological 
sample. 

(a) Tissue typing 

The AAP sequences of the invention can be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested 
with one or more restriction enzymes and probed on a Southern blot to yield unique 
bands. The sequences of the invention are useful as additional DNA markers for 
"restriction fragment length polymorphisms" (RFLP; (Smulson et al, US Patent No. 
5,272,057, 1993)). 
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Furthermore, the AAP sequences can be used to determine the actual base-by-base 
DNA sequence of targeted portions of an individual's genome. AAP sequences can be 
used to prepare two PCR primers from the 5'- and 3'-termini of the sequences that can 
then be used to amplify an the corresponding sequences from an individual's genome and 
then sequence the amplified fragment. 

Panels of corresponding DNA sequences from individuals can provide unique 
individual identifications, as each individual will have a unique set of such DNA 
sequences due to allelic differences. The sequences of the invention can be used to obtain 
such identification sequences from individuals and from tissue. The AAP sequences of 
the invention uniquely represent portions of an individual's genome. Allelic variation 
occurs to some degree in the coding regions of these sequences, and to a greater degree in 
the noncoding regions. The allelic variation between individual humans occurs with a 
frequency of about once ever 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include RFLPs. 

Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. 
Because greater numbers of polymorphisms occur in noncoding regions, fewer sequences 
are necessary to differentiate individuals. Noncoding sequences can positively identify 
individuals with a panel of 10 to 1,000 primers that each yield a noncoding amplified 
sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID NOS:l, 3, 
5, 7, 9, 11, 13, and 15 are used, a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

Predictive medicine 

The invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to treat an individual prophylactically. Accordingly, one 
aspect of the invention relates to diagnostic assays for determining an AAP and/or nucleic 
acid expression as well as AAP activity, in the context of a biological sample {e.g., blood, 
serum, cells, tissue) to determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant AAP expression 
or activity, including cancer. The invention also provides for prognostic (or predictive) 
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assays for determining whether an individual is at risk of developing a disorder associated 
with an AAP, nucleic acid expression or activity. For example, mutations in an AAP can 
be assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to prophylactically treat an individual prior to the onset of a disorder 
characterized by or associated with the AAP, nucleic acid expression, or biological 
activity. 

Another aspect of the invention provides methods for determining AAP activity, 
or nucleic acid expression, in an individual to select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of modalities (e.g., drugs, foods) for 
therapeutic or prophylactic treatment of an individual based on the individual's genotype 
(e.g., the individual's genotype to determine the individual's ability to respond to a 
particular agent). Another aspect of the invention pertains to monitoring the influence of 
modalities (e.g., drugs, foods) on the expression or activity of an AAP in clinical trials. 

1. Diagnostic assays 

An exemplary method for detecting the presence or absence of an AAP in a 
biological sample involves obtaining a biological sample from a subject and contacting 
the biological sample with a compound or an agent capable of detecting the AAP or the 
AAP nucleic acid (e.g., mRNA, genomic DNA) such that the presence of the AAP is 
confirmed in the sample. An agent for detecting the AAP mRNA or genomic DNA is a 
labeled nucleic acid probe that can hybridize to the AAP mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length AAP nucleic acid, such as the nucleic 
acid of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13 or 15, or a portion thereof, such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient 
to specifically hybridize under stringent conditions to an AAP mRNA or genomic DNA. 

An agent for detecting an AAP polypeptide is an antibody capable of binding to 
the AAP, preferably an antibody with a detectable label. Abs can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment (e.g., F ab or F(ab') 2 ) can be 
used. A labeled probe or antibody is coupled (i.e., physically linking) to a detectable 
substance, as well as indirect detection of the probe or antibody by reactivity with another 
reagent that is directly labeled. Examples of indirect labeling include detection of a 
primary antibody using a fluorescently-labeled secondary antibody and end-labeling of a 
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DNA probe with biotin such that it can be detected with fluorescently-labeled 
streptavidin. The term "biological sample" includes tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. The 
detection method of the invention can be used to detect an AAP mRNA, protein, or 
genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of an AAP mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of an AAP polypeptide include enzyme 
linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of an AAP genomic DNA include 
Southern hybridizations and fluorescence in situ hybridization (FISH). Furthermore, in 
vivo techniques for detecting an AAP include introducing into a subject a labeled anti- 
AAP antibody. For example, the antibody can be labeled with a radioactive marker 
whose presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample from the subject contains protein 
molecules, and/or mRNA molecules, and/or genomic DNA molecules. A preferred 
biological sample is blood. 

In another embodiment, the methods further involve obtaining a biological sample 
from a subject to provide a control, contacting the sample with a compound or agent to 
detect an AAP, mRNA, or genomic DNA, and comparing the presence of the AAP, 
mRNA or genomic DNA in the control sample with the presence of the AAP, mRNA or 
genomic DNA in the test sample. 

The invention also encompasses kits for detecting an AAP in a biological sample. 
For example, the kit can comprise: a labeled compound or agent capable of detecting an 
AAP or an AAP mRNA in a sample; reagent and/or equipment for determining the 
amount of an AAP in the sample; and reagent and/or equipment for comparing the 
amount of an AAP in the sample with a standard. The compound or agent can be 
packaged in a suitable container. The kit can further comprise instructions for using the 
kit to detect the AAP or nucleic acid. 
2. Prognostic assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with an aberrant 
AAP expression or activity. For example, the assays described herein, can be used to 
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identify a subject having or at risk of developing a disorder associated with AAP, nucleic 
acid expression or activity. Alternatively, the prognostic assays can be used to identify a 
subject having or at risk for developing a disease or disorder. Tthe invention provides a 
method for identifying a disease or disorder associated with an aberrant AAP expression 
or activity in which a test sample is obtained from a subject and the AAP or nucleic acid 
(e.g., mRNA, genomic DNA) is detected. A test sample is a biological sample obtained 
from a subject. For example, a test sample can be a biological fluid (e.g., serum), cell 
sample, or tissue. 

Pognostic assays can be used to determine whether a subject can be administered 
a modality (e.g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, 
small molecule, food, etc.) to treat a disease or disorder associated with an aberrant AAP 
expression or activity. Such methods can be used to determine whether a subject can be 
effectively treated with an agent for a disorder. The invention provides methods for 
determining whether a subject can be effectively treated with an agent for a disorder 
associated with an aberrant AAP expression or activity in which a test sample is obtained 
and the AAP or nucleic acid is detected (e.g., where the presence of the AAP or nucleic 
acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with the aberrant AAP expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an AAP 
to determine if a subject with the genetic lesion is at risk for a disorder characterized by 
aberrant angiogenesis. Methods include detecting, in a sample from the subject, the 
presence or absence of a genetic lesion characterized by at an alteration affecting the 
integrity of a gene encoding an AAP polypeptide, or the mis-expression of an AAP. Such 
genetic lesions can be detected by ascertaining: (1) a deletion of one or more nucleotides 
from an AAP; (2) an addition of one or more nucleotides to an AAP; (3) a substitution of 
one or more nucleotides in an AAP, (4) a chromosomal rearrangement of an AAP gene; 
(5) an alteration in the level of an AAP mRNA transcripts, (6) aberrant modification of an 
AAP, such as a change genomic DNA methylation, (7) the presence of a non-wild-type 
splicing pattern of an AAP mRNA transcript, (8) a non-wild-type level of an AAP, (9) 
allelic loss of an AAP, and/or (10) inappropriate post-translational modification of an 
AAP polypeptide. There are a large number of known assay techniques that can be used 
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to detect lesions in an AAP. Any biological sample containing nucleated cells may be 
used. 

In certain embodiments, lesion detection may use a probe/primer in a polymerase 
chain reaction (PCR) {e.g., (Mullis, US Patent No. 4,683,202, 1987; Mullis et al, US 

5 Patent No. 4,683,195, 1987), such as anchor PCR or rapid amplification of cDNA ends 

(RACE) PCR, or, alternatively, in a ligation chain reaction (LCR) (e.g., (Landegren et al, 
1988; Nakazawa et al, 1994), the latter is particularly useful for detecting point mutations 
in AAP-genes (Abravaya et al. , 1995). This method may include collecting a sample 
from a patient, isolating nucleic acids from the sample, contacting the nucleic acids with 
1 0 one or more primers that specifically hybridize to an AAP under conditions such that 

hybridization and amplification of the AAP (if present) occurs, and detecting the presence 
or absence of an amplification product, or detecting the size of the amplification product 
and comparing the length to a control sample. It is anticipated that PCR and/or LCR may 
be desirable to use as a preliminary amplification step in conjunction with any of the 

5 techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al., 1990), transcriptional amplification system (Kwoh et al., 1989); Qf3 
Replicase (Lizardi et al., 1988), or any other nucleic acid amplification method, followed 
by the detection of the amplified molecules using techniques well known to those of skill 

0 in the art. These detection schemes are especially useful for the detection of nucleic acid 

molecules present in low abundance. 

Mutations in an AAP from a sample can be identified by alterations in restriction 
enzyme cleavage patterns. For example, sample and control DNA is isolated, amplified 
(optionally), digested with one or more restriction endonucleases, and fragment length 

5 sizes are determined by gel electrophoresis and compared. Differences in fragment 

length sizes between sample and control DNA indicates mutations in the sample DNA. 
Moreover, the use of sequence specific ribozymes can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

Hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high- 

0 density arrays containing hundreds or thousands of oligonucleotides probes, can identify 

genetic mutations in an AAP (Cronin et al., 1996; Kozal et al, 1996). For example, 
genetic mutations in an AAP can be identified in two-dimensional arrays containing 
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light-generated DNA probes as described in Cronin, et al., supra. Briefly, a first 
hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the sequences by making linear 
arrays of sequential overlapping probes. This step allows the identification of point 
mutations. This is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the 
art can be used to directly sequence an AAP and detect mutations by comparing the 
sequence of the sample AAP-with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on classic techniques (Maxam and 
Gilbert, 1977; Sanger et al., 1977). Any of a variety of automated sequencing procedures 
can be used when performing diagnostic assays (Naeve et al., 1995) including sequencing 
by mass spectrometry (Cohen et at., 1996; Griffin and Griffin, 1993; Koster, 
WO94/16101, 1994). 

Other methods for detecting mutations in an AAP include those in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al., 1985). In general, the technique of "mismatch 
cleavage" starts by providing heteroduplexes formed by hybridizing (labeled) RNA or 
DNA containing a wild-type AAP sequence with potentially mutant RNA or DNA 
obtained from a sample. The double-stranded duplexes are treated with an agent that 
cleaves single-stranded regions of the duplex such as those that arise from base pair 
mismatches between the control and sample strands. For instance, RNA/DNA duplexes 
can be treated with RNase and DNA/DNA hybrids treated with Si nuclease to 
enzymatically digest the mismatched regions. In other embodiments, either DNA/DNA 
or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide and with 
piperidine in order to digest mismatched regions. The digested material is then separated 
by size on denaturing polyacrylamide gels to determine the mutation site (Grompe et al, 
1989; Saleeba and Cotton, 1993). The control DNA or RNA can be labeled for detection. 
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Mismatch cleavage reactions may employ one or more proteins that recognize 
mismatched base pairs in double-stranded DNA (DNA mismatch repair) in defined 
systems for detecting and mapping point mutations in an AAP cDNAs obtained from 
samples of cells. For example, the mutY enzyme of E. coli cleaves A at G/A mismatches 
and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu 
et al, 1994). According to an exemplary embodiment, a probe based on a wild-type AAP 
sequence is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like (Modrich et al, US Patent No. 
5,459,039, 1995). 

Electrophoretic mobility alterations can be used to identify mutations in an AAP. 
For example, single strand conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant and wild type nucleic acids 
(Cotton, 1993; Hayashi, 1992; Orita et al, 1989). Single-stranded DNA fragments of 
sample and control AAP nucleic acids are denatured and then renatured. The secondary 
structure of single-stranded nucleic acids varies according to sequence; the resulting 
alteration in electrophoretic mobility allows detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled probes. The sensitivity of the 
assay may be enhanced by using RNA (rather than DNA), in which the secondary 
structure is more sensitive to a sequence changes. The subject method may use 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility (Keen et al, 1991). 

The migration of mutant or wild-type fragments can be assayed using denaturing 
gradient gel electrophoresis (DGGE; (Myers et al, 1985). In DGGE, DNA is modified to 
prevent complete denaturation, for example by adding a GC clamp of approximately 40 
bp of high-melting GC-rich DNA by PCR. A temperature gradient may also be used in 
place of a denaturing gradient to identify differences in the mobility of control and sample 
DNA (Rossiter and Caskey, 1990). 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
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that permit hybridization only if a perfect match is found (Saiki et al, 1986; Saiki et al, 
1989). Such allele-specific oligonucleotides are hybridized to PCR-amplified target DNA 
or a number of different mutations when the oligonucleotides are attached to the 
hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective 
PCR amplification may be used. Oligonucleotide primers for specific amplifications may 
carry the mutation of interest in the center of the molecule (so that amplification depends 
on differential hybridization (Gibbs et al, 1989)) or at the extreme 3'-terminus of one 
primer where, under appropriate conditions, mismatch can prevent, or reduce polymerase 
extension (Prosser, 1993). Novel restriction site in the region of the mutation may be 
introduced to create cleavage-based detection (Gasparini et al, 1992). Certain 
amplification may also be performed using Taq ligase for amplification (Barany, 1991). 
In such cases, ligation occurs only if there is a perfect match at the 3 '-terminus of the 5' 
sequence, allowing detection of a known mutation by scoring for amplification. 

The described methods may be performed, for example, by using pre-packaged 
kits comprising at least one probe (nucleic acid or antibody) that may be conveniently 
used, for example, in clinical settings to diagnose patients exhibiting symptoms or family 
history of a disease or illness involving an AAP. 

Furthermore, any cell type or tissue in which an AAP is expressed may be utilized 
in the prognostic assays described herein. 

3. Pharmaco genomics 

Agents, or modulators that have a stimulatory or inhibitory effect on AAP activity 
or expression, as identified by a screening assay can be administered to individuals to 
treat, prophylactically or therapeutically, disorders, including those associated with 
angiogenesis. In conjunction with such treatment, the pharmacogenomics {i.e., the study 
of the relationship between a subject's genotype and the subject's response to a foreign 
modality, such as a food, compound or drug) may be considered. Metabolic differences 
of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation 
between dose and blood concentration of the pharmacologically active drug. Thus, the 
pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) 
for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Pharmacogenomics can further be used to determine appropriate dosages and 
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therapeutic regimens. Accordingly, the activity of an AAP, expression of an AAP nucleic 
acid, or an AAP mutation(s) in an individual can be determined to guide the selection of 
appropriate agent(s) for therapeutic or prophylactic treatment. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to modalities due to altered modality disposition and abnormal action in affected 
persons (Eichelbaum and Evert, 1996; Linder et ai, 1997). In general, two 
pharmacogenetic conditions can be differentiated: (1) genetic conditions transmitted as a 
single factor altering the interaction of a modality with the body (altered drug action) or 
(2) genetic conditions transmitted as single factors altering the way the body acts on a 
modality (altered drug metabolism). These pharmacogenetic conditions can occur either 
as rare defects or as nucleic acid polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) explains the phenomena of 
some patients who show exaggerated drug response and/or serious toxicity after taking 
the standard and safe dose of a drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). 
The prevalence of PM is different among different populations. For example, the 
CYP2D6 gene is highly polymorphic and several mutations have been identified in PM, 
which all lead to the absence of functional CYP2D6. Poor metabolizers due to mutant 
CYP2D6 and CYP2C19 frequently experience exaggerated drug responses and side 
effects when they receive standard doses. If a metabolite is the active therapeutic moiety, 
PM shows no therapeutic response, as demonstrated for the analgesic effect of codeine 
mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so- 
called ultra-rapid metabolizers who are unresponsive to standard doses. Recently, the 
molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 
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The activity of an AAP, expression of an AAP nucleic acid, or mutation content of 
an AAP in an individual can be determined to select appropriate agent(s) for therapeutic 
or prophylactic treatment of the individual. In addition, pharmacogenetic studies can be 
used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotype. This knowledge, 
when applied to dosing or drug selection, can avoid adverse reactions or therapeutic 
failure and thus enhance therapeutic or prophylactic efficiency when treating a subject 
with an AAP modulator, such as a modulator identified by one of the described 
exemplary screening assays. 

4. Monitoring effects during clinical trials 

Monitoring the influence of agents (e.g. , drugs, compounds) on the expression or 
activity of an AAP (e.g., the ability to modulate angiogenesis) can be applied not only in 
basic drug screening, but also in clinical trials. For example, the effectiveness of an agent 
determined by a screening assay to increase an AAP expression, protein levels, or up- 
regulate an AAP's activity can be monitored in clinical trails of subjects exhibiting 
decreased AAP expression, protein levels, or down -regulated AAP activity. Alternatively, 
the effectiveness of an agent determined to decrease an AAP expression, protein levels, or 
down-regulate an AAP's activity, can be monitored in clinical trails of subjects exhibiting 
increased the AAP expression, protein levels, or up-regulated AAP activity. In such 
clinical trials, the expression or activity of the AAP and, preferably, other genes that have 
been implicated in, for example, angiogenesis can be used as a "read out" or markers for a 
particular cell's responsiveness. 

For example, genes, including an AAP, that are modulated in cells by treatment 
with a modality (e.g. , food, compound, drug or small molecule) can be identified. To 
study the effect of agents on angiogenesis, for example, in a clinical trial, cells can be 
isolated and RNA prepared and analyzed for the levels of expression of an AAP and other 
genes implicated in the disorder. The gene expression pattern can be quantified by 
Northern blot analysis, nuclear run-on or RT-PCR experiments, or by measuring the 
amount of protein, or by measuring the activity level of the AAP or other gene products. 
In this manner, the gene expression pattern itself can serve as a marker, indicative of the 
cellular physiological response to the agent. Accordingly, this response state may be 



- 123- 



determined before, and at various points during, treatment of the individual with the 
agent. 

The invention provides a method for monitoring the effectiveness of treatment of 
a subject with an agent (e.g. , an agonist, antagonist, protein, peptide, peptidomimetic, 

5 nucleic acid, small molecule, food or other drug candidate identified by the screening 

assays described herein) comprising the steps of (1) obtaining a pre-administration 
sample from a subject; (2) detecting the level of expression of an AAP, mRNA, or 
genomic DNA in the preadmini strati on sample; (3) obtaining one or more post- 
administration samples from the subject; (4) detecting the level of expression or activity 
1 0 of the AAP, mRNA, or genomic DNA in the post-administration samples; (5) comparing 

the level of expression or activity of the AAP, mRNA, or genomic DNA in the pre- 
administration sample with the AAP, mRNA, or genomic DNA in the post administration 
sample or samples; and (6) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to 

5 increase the expression or activity of the AAP to higher levels than detected, i.e., to 

increase the effectiveness of the agent. Alternatively, decreased administration of the 
agent may be desirable to decrease expression or activity of the AAP to lower levels than 
detected, i.e., to decrease the effectiveness of the agent. 
5. Methods of treatment 

0 The invention provides for both prophylactic and therapeutic methods of treating a 

subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant AAP expression or activity. Furthermore, these same methods of treatment may 
be used to induce or inhibit angiogenesis, by changing the level of expression or activity 
of an AAP. 

5 6. Disease and disorders 

Diseases and disorders that are characterized by increased AAP levels or 
biological activity may be treated with therapeutics that antagonize (i.e., reduce or inhibit) 
activity. Antognists may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be used include: (1) AAP peptides, or analogs, derivatives, 

) fragments or homologs thereof; (2) Abs to an AAP peptide; (3) AAP nucleic acids; (4) 

administration of antisense nucleic acid and nucleic acids that are "dysfunctional" (i.e., 
due to a heterologous insertion within the coding sequences) that are used to eliminate 



- 124- 



endogenous function of by homologous recombination (Capecchi, 1989); or (5) 
modulators (i.e., inhibitors, agonists and antagonists, including additional peptide mimetic 
of the invention or Abs specific to an AAP) that alter the interaction between an AAP and 
its binding partner. 

Diseases and disorders that are characterized by decreased AAP levels or 
biological activity may be treated with therapeutics that increase (i.e., are agonists to) 
activity. Therapeutics that upregulate activity may be administered therapeutically or 
prophylactically. Therapeutics that may be used include peptides, or analogs, derivatives, 
fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying 
in vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or 
AAP mRNAs). Methods include, but are not limited to, immunoassays (e.g., by Western 
blot analysis, immunoprecipitation followed by sodium dodecyl sulfate (SDS) 
polyacrylamide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization 
assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ 
hybridization, and the like). 

7. Prophylactic methods 

The invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant AAP expression or activity, by administering an 
agent that modulates an AAP expression or at least one AAP activity. Subjects at risk for 
a disease that is caused or contributed to by an aberrant AAP expression or activity can be 
identified by, for example, any or a combination of diagnostic or prognostic assays. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the AAP aberrancy, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending on the type of AAP aberrancy, for 
example, an AAP agonist or AAP antagonist can be used to treat the subject. The 
appropriate agent can be determined based on screening assays. 

8. Therapeutic methods 

Another aspect of the invention pertains to methods of modulating an AAP 
expression or activity for therapeutic purposes. The modulatory method of the invention 
involves contacting a cell with an agent that modulates one or more of the activities of 
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AAP activity associated with the cell. An agent that modulates AAP activity can be a 
nucleic acid or a protein, a naturally occurring cognate ligand of an AAP, a peptide, an 
AAP peptidomimetic, or other small molecule. The agent may stimulate AAP activity. 
Examples of such stimulatory agents include an active AAP and an AAP nucleic acid 

5 molecule that has been introduced into the cell. In another embodiment, the agent inhibits 

AAP activity. Examples of inhibitory agents include antisense AAP nucleic acids and 
anti-AAP Abs. Modulatory methods can be performed in vitro (e.g., by culturing the cell 
with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As 
such, the invention provides methods of treating an individual afflicted with a disease or 

0 disorder characterized by aberrant expression or activity of an AAP or nucleic acid 

molecule. In one embodiment, the method involves administering an agent (e.g., an agent 
identified by a screening assay), or combination of agents that modulates (e.g., up- 
regulates or down-regulates) AAP expression or activity. In another embodiment, the 
method involves administering an AAP or nucleic acid molecule as therapy to 

5 compensate for reduced or aberrant AAP expression or activity. 

Stimulation of AAP activity is desirable in situations in which AAP is abnormally 
down-regulated and/or in which increased AAP activity is likely to have a beneficial 
effect. One example of such a situation is where a subject has a disorder characterized by 
aberrant angiogenesis (e.g., cancer). 

0 9. Determination of the biological effect of the therapeutic 

Suitable in vitro or in vivo assays can be performed to determine the effect of a 
specific therapeutic and whether its administration is indicated for treatment of the 
affected tissue. 

In various specific embodiments, in vitro assays may be performed with 
5 representative cells of the type(s) involved in the patient's disorder, to determine if a 

given therapeutic exerts the desired effect upon the cell type(s). Modalities for use in 
therapy may be tested in suitable animal model systems including, but not limited to rats, 
mice, chicken, cows, monkeys, rabbits, and the like, prior to testing in human subjects. 
Similarly, for in vivo testing, any of the animal model system known in the art may be 
) used prior to administration to human subjects. 

10. Prophylactic and therapeutic uses of the compositions of the invention 
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AAP nucleic acids and proteins are useful in potential prophylactic and 
therapeutic applications implicated in a variety of disorders including, but not limited to 
those related to angiogenesis. 

As an example, a cDNA encoding an AAP may be useful in gene therapy, and the 
protein may be useful when administered to a subject in need thereof. 

AAP nucleic acids, or fragments thereof, may also be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein is to be 
assessed. A further use could be as an anti-bacterial molecule (i.e., some peptides have 
been found to possess anti-bacterial properties). These materials are further useful in the 
generation of Abs that immunospecifically bind to the novel substances of the invention 
for use in therapeutic or diagnostic methods. 

The following example is meant to not be limiting. 

Example 

Identification of genes differentially-regulated 

A comprehensive mRNA profiling technique (GeneCalling) was used to 
determine differential gene expression profiles of human endothelial cells undergoing 
differentiation into tube-like structures (Kahn et al, 2000). To confirm the expression 
data from GeneCalling, independent experiments were undertaken that used gene-specific 
PCR oligonucleotide primer pairs and an oligonucleotide probe labeled with a fluorescent 
dye at the 5' end and quencher fluorescent dye at the 3' end.. Total RNA (50 ng) was 
added to a 50 ul RT-PCR mixture and run. 
The following data were collected: 



hEF G collagen gel 24 hr versus 4h 4.5 fold upregulated 

hTRG collagen gel 24 hr versus 4h 3.5 fold upregulated 

KLP collagen gel 24 hr versus 4h 3.5 fold upregulated 

myosin X collagen gel 24 hr versus 4h 3.5 fold upregulated 

NHR collagen gel 24 hr versus 4h 7.3 fold downregulated 

HBAZF collagen gel 24 hr versus 4h 2.1 fold upregulated 



Equivalents 
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Although particular embodiments have been disclosed herein in detail, this has 
been done by way of example for purposes of illustration only, and is not intended to be 
limiting with respect to the scope of the appended claims that follow. In particular, it is 
contemplated by the inventors that various substitutions, alterations, and modifications 
may be made to the invention without departing from the spirit and scope of the invention 
as defined by the claims. The choice of nucleic acid starting material, clone of interest, or 
library type is believed to be a matter of routine for a person of ordinary skill in the art 
with knowledge of the embodiments described herein. Other aspects, advantages, and 
modifications considered within the scope of the following claims. 
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